Package smile.stat
Class GoodTuring
java.lang.Object
smile.stat.GoodTuring
Good–Turing frequency estimation. This technique is for estimating the
probability of encountering an object of a hitherto unseen species,
given a set of past observations of objects from different species.
In drawing balls from an urn, the 'objects' would be balls and the
'species' would be the distinct colors of the balls (finite but
unknown number). After drawing R_red red balls, R_black black
balls and , R_green green balls, we would ask what is the probability
of drawing a red ball, a black ball, a green ball or one of a
previously unseen color.
This method takes a set of (frequency, frequency-of-frequency) pairs and estimate the probabilities corresponding to the observed frequencies, and P0, the joint probability of all unobserved species.
-
Field Summary
-
Method Summary
Modifier and TypeMethodDescriptionstatic GoodTuring
of
(int[] r, int[] Nr) Good–Turing frequency estimation.
-
Field Details
-
p
public final double[] pThe probabilities corresponding to the observed frequencies. -
p0
public final double p0The joint probability of all unobserved species.
-
-
Method Details
-
of
Good–Turing frequency estimation.- Parameters:
r
- the frequency in ascending order.Nr
- the frequency of frequencies.- Returns:
- the estimation object.
-