logo

Machine Learning

Machine Learning is the process where a machine learns to identify features from an existing dataset so that it can accurately identify features in new data.

This definition isn’t strictly formal, and there’s no need for it to be. The term machine simply refers to a computer, or programming code. The dataset used for learning is known as a training set. If we compare machine learning to a student studying for exams, the analogy would be:

  • Machine: Student
  • Training Set: Past exam papers
  • Features: Exam patterns
  • New Data: Actual exam questions
  • Learning: The process of solving past papers to prepare for actual exam questions

As of 2021, the most commonly used method for implementing machine learning is deep learning, which involves increasing the number of hidden layers in artificial neural networks. Artificial neural networks have significantly improved in performance recently and often deliver the best results. Before deep learning showed satisfactory performance, the mainstream of machine learning was models based on statistical theories.

To become proficient in machine learning, one must have knowledge in mathematics, statistics, and programming. Mathematical and statistical knowledge is required to understand the theories, and programming skills are necessary to implement these theories. In-depth study of machine learning theories requires not only knowledge of matrix and linear algebra but also measure theory, functional analysis, and more. Recently, research combining artificial intelligence with geometry, graph theory, partial differential equations, and other fields is also underway.1

The following articles are written to be as accessible as possible for mathematics majors.

Basics

Learning Concepts

Optimization

Classical Machine Learning

Linear Regression Models

Linear Classification Models

  • Linear Classification Models
    • Least Squares Method
  • Fisher’s Linear Discriminant
  • Neyman-Pearson Criterion for Binary Classification
  • Bayes Risk Classifier

Clustering

Sampling

Reinforcement Learning

Deep Learning

Cheat Sheet: Equivalent Code in Flux, PyTorch, TensorFlow

Theory

Regularization Techniques

Various Neural Networks

PyTorch

General

Neural Networks

Tensors

Errors

Julia

Flux

References

  • Christoper M. Bishop, Pattern Recognition annd Machine Learning (2006)
  • Simon Haykin, Neural Networks and Learning Machines (3rd Edition, 2009)
  • Trevor Hastie, The Elements of Statistical Learning: Data Mining, Inference, and Prediction (2nd Edition, 2017)
  • 오일석, 기계 학습(MACHINE LEARNING) (2017)
  • Richard S. Sutton, Reinforcement Learning: An Introduction (2nd Edition, 2018)

All posts