public class TransformStatistics extends Object
Transform
implementations. Given a Matrix
or a Matrix
file, this class will gather row summations, column summations, and
the total summation of the matrix. Optionally, when gathering either the row
or column summations, the number of non zero values in the row or column can
be counted instead of a full summation, which is needed for the TfIdfTransform
.Modifier and Type | Class and Description |
---|---|
static class |
TransformStatistics.MatrixStatistics
A struct recording the row, column, and matrix summations as doubles.
|
Constructor and Description |
---|
TransformStatistics() |
Modifier and Type | Method and Description |
---|---|
static TransformStatistics.MatrixStatistics |
extractStatistics(File inputMatrixFile,
MatrixIO.Format format)
Extracts the full row, column, and matrix summations based on entries in
the given
Matrix file. |
static TransformStatistics.MatrixStatistics |
extractStatistics(File inputMatrixFile,
MatrixIO.Format format,
boolean countRowOccurrances,
boolean countColumnOccurrances)
Extracts the row, column, and matrix summations based on entries in
the given
Matrix . |
static TransformStatistics.MatrixStatistics |
extractStatistics(Matrix matrix)
Extracts the full row, column, and matrix summations based on entries in
the given
Matrix . |
static TransformStatistics.MatrixStatistics |
extractStatistics(Matrix matrix,
boolean countRowOccurrances,
boolean countColumnOccurrances)
Extracts the row, column, and matrix summations based on entries in
the given
Matrix . |
public static TransformStatistics.MatrixStatistics extractStatistics(Matrix matrix)
Matrix
.matrix
- a to sum over
TransformStatistics.MatrixStatistics
instance containing the summationspublic static TransformStatistics.MatrixStatistics extractStatistics(Matrix matrix, boolean countRowOccurrances, boolean countColumnOccurrances)
Matrix
. If countRowOccurrances
is true, the
number of non zeros in each row will be counted for the row summation.
If countColumnOccurrances
is true, the same will be done for the
columns. In either case, the matrix summation will remain the same.matrix
- a Matrix
to sum overcountRowOccurrances
- true if the row summation should only count
the number of non zero values in a rowcountColumnOccurrances
- true if the column summation should only
count the number of non zero values in a columnTransformStatistics.MatrixStatistics
instance containing the summationspublic static TransformStatistics.MatrixStatistics extractStatistics(File inputMatrixFile, MatrixIO.Format format)
Matrix
file.inputMatrixFfile
- a Matrix
file to sum overformat
- the matrix MatrixIO.Format
of inputMatrixFile
TransformStatistics.MatrixStatistics
instance containing the summationspublic static TransformStatistics.MatrixStatistics extractStatistics(File inputMatrixFile, MatrixIO.Format format, boolean countRowOccurrances, boolean countColumnOccurrances)
Matrix
. If countRowOccurrances
is true, the
number of non zeros in each row will be counted for the row summation.
If countColumnOccurrances
is true, the same will be done for the
columns. In either case, the matrix summation will remain the same.inputMatrixFfile
- a Matrix
file to sum overformat
- the matrix MatrixIO.Format
of inputMatrixFile
countRowOccurrances
- true if the row summation should only count
the number of non zero values in a rowcountColumnOccurrances
- true if the column summation should only
count the number of non zero values in a columnTransformStatistics.MatrixStatistics
instance containing the summationsCopyright © 2012. All Rights Reserved.