AI / ML Engineer full Overview
Fundamentals of ML:
Introduction to AI and ML:
- What is AI, ML?
- Applications of ML
- ML Components
- Supervised Learning
- Classification,Regression
- Python for ML
Linear Regression:
- Hypothesis function
- Cost function
- Gradient Descent
- Normal Equation
- Vectorization
- Polynomial Regression
K-Nearest Neighbours:
- k-NN for classification
- Choice of k
- KD Trees
- LSH and Inverted Indices
- Instance based learning
Logistic Regression:
- Sigmoid Function
- Cost Function
- Maximum Likelihood Estimate
- GD for Logistic Regression
- Multiclass Classification
Data Preprocessing:
- Handling missing values
- One-hot Encoding
- Feature Scaling
- Feature Selection
Regularization:
Supervised Machine Learning Algorithms:
SVM:
- Maximum Margin Classifier
- Support Vectors
- Handling Outliers
- Kernels
- SVM as large margin Classifier
- Constrained Optimization
Decision Trees:
- Choosing the best attribute
- Entropy
- Information Gain
- Gini Index
- ID3,C4.5
- Handling missing values
- Pruning
- Reduced-error Pruning
- Rule post-Pruning
- Limitations of Decision Trees
Naive Bayes:
- Conditional Probability and Bayes Rule
- Naive Bayes Classifier
- Naive Bayes Algorithm
- Laplace Smoothing
- NB for text classification
- Handling Real-world Problems
Algorithms for Regression:
- Regression Trees
- Choice of Threshold & Attribute
- Overfitting
- Cost Complexity Pruning
- k-NN for Regression
- Locally Weighted Regression
Learning Theory:
- Evaluation of hypothesis function
- Tuning HyperParameters
- Bias, Variance and Noise Decomposition
- Error Analysis
- Performance Optimization
Bagging:
- Bootstrapping
- Out-of-bag Error
- Bagged Decision Trees
- Random Forests
Boosting:
- Gradient Boosting
- AdaBoost
- Stochastic Gradient Boosting
- ECOC
- Stacking
Competitive Machine Learning:
Introduction:
- Introduction to Competitive ML
- Exploratory Data Analysis
- Introduction to Pandas
- Visualizations
- Dataset Cleaning
- Introduction to scikit -learn
Advanced Pre-Processing & Feature Engineering
- Missing values Imputation
- Label Encoding
- Target Encoding
- Mean Target Encoding
- Feature Interactions
Validation
- K-fold Cross-Validation
- Validation Strategies
- Data-Splitting Strategies
- Pros and Cons of Validation
Modelling
- Hyperparameter Tuning
- Grid Search
- Advanced Model Ensembling
- Metrics Optimization
Thank you. Hope you gained knowledge from this article.