smile.classification.AbstractClassifier<int[]>

smile.classification.Maxent

All Implemented Interfaces:: Serializable, ToDoubleFunction<int[]>, ToIntFunction<int[]>, Classifier<int[]>

Direct Known Subclasses:: Maxent.Binomial, Maxent.Multinomial

public abstract class Maxent extends AbstractClassifier<int[]>

Maximum Entropy Classifier. Maximum entropy is a technique for learning probability distributions from data. In maximum entropy models, the observed data itself is assumed to be the testable information. Maximum entropy models don't assume anything about the probability distribution other than what have been observed and always choose the most uniform distribution subject to the observed constraints.

Basically, maximum entropy classifier is another name of multinomial logistic regression applied to categorical independent variables, which are converted to binary dummy variables. Maximum entropy models are widely used in natural language processing. Here, we provide an implementation which assumes that binary features are stored in a sparse array, of which entries are the indices of nonzero features.

See Also:

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

Maxent.Binomial

Binomial maximum entropy classifier.

static class

Maxent.Multinomial

Multinomial maximum entropy classifier.

static final record

Maxent.Options

Maximum entropy classifier hyperparameters.

Nested classes/interfaces inherited from interface Classifier
Classifier.Trainer<T,M>
Field Summary

Fields inherited from class AbstractClassifier
classes
Constructor Summary

Constructors

Constructor

Description

Maxent(int p, double L, double lambda, IntSet labels)

Constructor.
Method Summary

Modifier and Type

Method

Description

double

AIC()

Returns the AIC score.

static Maxent.Binomial

binomial(int p, int[][] x, int[] y)

Fits maximum entropy classifier.

static Maxent.Binomial

binomial(int p, int[][] x, int[] y, Maxent.Options options)

Fits maximum entropy classifier.

int

dimension()

Returns the dimension of input space.

static Maxent

fit(int p, int[][] x, int[] y)

Fits maximum entropy classifier.

static Maxent

fit(int p, int[][] x, int[] y, Maxent.Options options)

Fits maximum entropy classifier.

double

getLearningRate()

Returns the learning rate of stochastic gradient descent.

double

loglikelihood()

Returns the log-likelihood of model.

static Maxent.Multinomial

multinomial(int p, int[][] x, int[] y)

Fits maximum entropy classifier.

static Maxent.Multinomial

multinomial(int p, int[][] x, int[] y, Maxent.Options options)

Fits maximum entropy classifier.

boolean

online()

Returns true if this is an online learner.

void

setLearningRate(double rate)

Sets the learning rate of stochastic gradient descent.

boolean

soft()

Returns true if this is a soft classifier that can estimate the posteriori probabilities of classification.

Methods inherited from class AbstractClassifier
classes, numClasses

Methods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Methods inherited from interface Classifier
applyAsDouble, applyAsInt, predict, predict, predict, predict, predict, predict, predict, predict, score, update, update, update

Constructor Details
- Maxent
  
  public Maxent(int p, double L, double lambda, IntSet labels)
  
  Constructor.
  
  Parameters:
  
  p - the dimension of input data.
  
  L - the log-likelihood of learned model.
  
  lambda - lambda > 0 gives a "regularized" estimate of linear weights which often has superior generalization performance, especially when the dimensionality is high.
  
  labels - the class label encoder.
Method Details
- fit
  
  public static Maxent fit(int p, int[][] x, int[] y)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  Returns:
  
  the model.
- fit
  
  public static Maxent fit(int p, int[][] x, int[] y, Maxent.Options options)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  options - the hyperparameters.
  
  Returns:
  
  the model.
- binomial
  
  public static Maxent.Binomial binomial(int p, int[][] x, int[] y)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  Returns:
  
  the model.
- binomial
  
  public static Maxent.Binomial binomial(int p, int[][] x, int[] y, Maxent.Options options)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  options - the hyperparameters.
  
  Returns:
  
  the model.
- multinomial
  
  public static Maxent.Multinomial multinomial(int p, int[][] x, int[] y)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  Returns:
  
  the model.
- multinomial
  
  public static Maxent.Multinomial multinomial(int p, int[][] x, int[] y, Maxent.Options options)
  
  Fits maximum entropy classifier.
  
  Parameters:
  
  p - the dimension of feature space.
  
  x - training samples. Each sample is represented by a set of sparse binary features. The features are stored in an integer array, of which are the indices of nonzero features.
  
  y - training labels in [0, k), where k is the number of classes.
  
  options - the hyperparameters.
  
  Returns:
  
  the model.
- dimension
  
  public int dimension()
  
  Returns the dimension of input space.
  
  Returns:
  
  the dimension of input space.
- soft
  
  public boolean soft()
  
  Description copied from interface: Classifier
  
  Returns true if this is a soft classifier that can estimate the posteriori probabilities of classification.
  
  Returns:
  
  true if soft classifier.
- online
  
  public boolean online()
  
  Description copied from interface: Classifier
  
  Returns true if this is an online learner.
  
  Returns:
  
  true if online learner.
- setLearningRate
  
  public void setLearningRate(double rate)
  
  Sets the learning rate of stochastic gradient descent. It is a good practice to adapt the learning rate for different data sizes. For example, it is typical to set the learning rate to eta/n, where eta is in [0.1, 0.3] and n is the size of the training data.
  
  Parameters:
  
  rate - the learning rate.
- getLearningRate
  
  public double getLearningRate()
  
  Returns the learning rate of stochastic gradient descent.
  
  Returns:
  
  the learning rate of stochastic gradient descent.
- loglikelihood
  
  public double loglikelihood()
  
  Returns the log-likelihood of model.
  
  Returns:
  
  the log-likelihood of model.
- AIC
  
  public double AIC()
  
  Returns the AIC score.
  
  Returns:
  
  the AIC score.

Class Maxent

References

Nested Class Summary

Nested classes/interfaces inherited from interface Classifier

Field Summary

Fields inherited from class AbstractClassifier

Constructor Summary

Method Summary

Methods inherited from class AbstractClassifier

Methods inherited from class Object

Methods inherited from interface Classifier

Constructor Details

Maxent

Method Details

fit

fit

binomial

binomial

multinomial

multinomial

dimension

soft

online

setLearningRate

getLearningRate

loglikelihood

AIC