Smile - Home

Latest News

Smile 4.4.0 Released! (Jun 6, 2025)
Smile 4.3.0 Released! (Mar 3, 2025)
Smile 4.2.0 Released! (Feb 1, 2025)
Smile 4.1.0 Released! (Jan 12, 2025)
Smile 4.0.0 Released! (Nov 25, 2024)
Smile 3.1.1 Released! (May 22, 2024)
Smile 3.1.0 Released! (Apr 2, 2024)
Time-Series Prediction Model in Java Using the Smile Library (Oct 16, 2023, by Boqiang & Henry)
10 Best Machine Learning Libraries You Should Know in 2024 (May 16, 2023, by AlmaBetter Bytes)
Getting started with SMILE with Kotlin (Feb 23, 2021, by Magnus MacHale-Gunnarsson)
Dex Machine Learning with SMILE ML (Mar 28, 2018, by Patrick Martin)
3 projects lighting a fire under machine learning (Aug 30, 2017, by InfoWorld)
5 Machine Learning Projects You Can No Longer Overlook (Apr 19, 2017, by KDnuggets)

Built-in Algorithms:

Classification: Decision Trees, AdaBoost, Gradient Boosting, Random Forest, Logistic Regression, Neural Networks, Support Vector Machines, RBF Networks, Maximum Entropy Classifier, Generic Naïve Bayes Classifier, Naïve Bayes Document Classfier, Fisher / Linear / Quadratic / Regularized Discriminant Analysis, Platt Scaling, Isotonic Regression Scaling, One vs. One, One vs. Rest

Regression: Linear Regression, LASSO, ElasticNet, Ridge Regression, Regression Trees, Gradient Boosting, Random Forest, RBF Networks, Neural Networks, Support Vector Regression, Gaussian Process, Generalized Linear Model
Feature Engineering and Selection: Bag of Words, Sparse One Hot Encoding, Standardizer, Robust Standardizer, Maximum Absolute Value Scaler, Winsor Scaler, Normalizer, Genetic Algorithm based Feature Selection, Ensemble Learning based Feature Selection, TreeSHAP, Signal Noise ratio, Sum Squares ratio
Dimension Reduction: PCA, Kernel PCA, Probabilistic PCA, Generalized Hebbian Algorithm, Random Project, ICA
Model Validation: Cross Validation, Leave-One-Out Validation, Bootstrap, Confusion Matrix, Hyperparameter Tuning, AUC, LogLoss, CrossEntropy, Accuracy, Error, Fallout, FDR, F-Score, Precision, Recall, Sensitivity, Specificity, Matthews Correlation Coefficient, MSE, RMSE, RSS, R2, Mean Absolute Deviation, Rand Index, Adjusted Rand Index, Mutual Information Score,
Clustering: Hierarchical Clustering, CLARANS, DBSCAN, DENCLUE, K-Means, X-Means, G-Means, K-Modes, Deterministic Annealing, Sequential Information Bottleneck, Spectral Clustering, Minimum Entropy Clustering
Vector Quantization: BIRCH, Self-Organizing Maps, Neural Gas, Growing Neural Gas, Neural Map
Association Rules: Frequent Itemset Mining, Association Rule Mining
Manifold learning: IsoMap, LLE, Laplacian Eigenmap, t-SNE, UMAP, Classical MDS, Isotonic MDS, Sammon Mapping
Nearest Neighbor Search: Linear Search, BK-Tree, Cover Tree, KD-Tree, LSH, Multi-Probe LSH, SimHash
Sequence Learning: Hidden Markov Model, Conditional Random Field
Time Series: ACF, PACF, Box-Pierce and Ljung-Box Test, AR, ARMA
Natural Language Processing: Sentence Splitter, Tokenizer, Bigram Extractor, Phrase Extractor, Keyword Extractor, Porter Stemmer, Lancaster Stemmer, POS Tagging, Relevance Ranking, Word2Vec
Mathematics: Genetic Algorithms, Graph, Hash Functions, Interpolation, Sort Algorithms, Taxonomy, Wavelet
Linear Algebra: Dense Matrix, Band Matrix, Sparse Matrix, LU, Cholesky, QR, EVD, SVD, Biconjugate Gradient, BFGS, Computer Algebra System
Statistics: Distributions, Random Number Generators, Hypothesis Tests

Smile is a fast and comprehensive machine learning engine.

Smile now seems to be the go-to general-purpose machine learning library for those working in the Java and Scala worlds — a JVM Scikit-learn, if you will. I would actually find it hard to believe that you are working in that ecosystem and are unaware of the project.

- KDnuggets

Smile gives you a broad range of algorithms out of the box, ranging from simple functions like classification and regression to sophisticated offerings like natural language processing. And all you need is Java, or any JVM language.

- InfoWorld

Smile will amaze you with fast and extensive applications, efficient memory usage and a large set of machine learning algorithms for Classification, Regression, Nearest Neighbor Search, Feature Selection, etc.

- ActiveWizards

To say that I am satisfied with SMILE would be an understatement. It's truly one of the hidden gems in the Java framework ecosystem today.

- Patrick Martin, Principal Architect at Citi

LinkedIn used Smile to train its workforce on machine learning for its AI Academy. Smile was chosen because it's a Java library with a friendly open source license and supports a wide range of common algorithms.

- Ben McCann, Head of Hire Matching at LinkedIn

We leverage Smile's impressive capability in various machine learning tasks: feature engineering, modeling, visualization, benchmark test, etc. Thanks Smile and strongly recommend it to every engineer who is interested in machine learning.

- Ray Ma, Technology Manager at moKredit

SMILE is a great Java library for a wide range of AI tasks. Building bespoke methods atop SMILE run considerably faster than implementations in other languages more associated with data science.

- Shantanu Lodh, Senior Data Scientist at Hidden Depth AI, UK

Star Follow @haifengl Fork

Speed

With advanced data structures and algorithms, Smile delivers the state-of-art performance.

Compared to this third-party benchmark, Smile outperforms R, Python, Spark, H2O, xgboost significantly. Smile is several times faster than the closest competitor. The memory usage is also very efficient. If we can train advanced machine learning models on a PC, why buy a cluster?

Training Time (seconds)

Ease of Use

Write applications quickly in Java, Scala, or any JVM languages. Data scientists and developers can speak the same language now!

Smile provides hundreds advanced algorithms with clean interface. Scala/Kotlin API also offers high-level operators that make it easy to build machine learning apps. And you can use it interactively from the shell, embedded in Scala.


var iris = Read.arff("iris.arff");

var model = RandomForest.fit(Formula.lhs("class"), iris);

println(model.metrics());