All Posts

XGBoost For Regression

Dive into a step-by-step explanation of how XGBoost handles regression problems using a CGPA vs. salary dataset. Understand residual learning, tree construction, similarity scores, gain calculations, and how each stage progressively refines model accuracy. Ideal for beginners and intermediates mastering XGBoost.

Aryan

Aug 11

Introduction to XGBoost

XGBoost is one of the most powerful tools for structured/tabular data — known for its speed, scalability, and high performance. In this post, I’ve shared a detailed explanation of what makes XGBoost so effective, along with its history, features, and real-world use. A great resource for anyone learning ML!

Aryan

Jul 26

Gradient Boosting For Classification - 2

Gradient boosting shines in classification, combining weak learners like decision trees into a powerful model. By iteratively minimizing log loss, it corrects errors, excelling with imbalanced data and complex patterns. Tools like XGBoost and LightGBM offer flexibility via hyperparameters, making gradient boosting a top choice for data scientists tackling real-world classification tasks.

Aryan

Jun 25

Gradient Boosting For Classification - 1

Discover how Gradient Boosting builds powerful classifiers by turning weak learners into strong ones, step by step. From boosting logic to practical implementation, this blog walks you through an intuitive, beginner-friendly path using real-world data.

Aryan

Jun 20

Gradient Boosting For Regression - 2

Gradient Boosting is a powerful machine learning technique that builds strong models by combining weak learners. It minimizes errors using gradient descent and is widely used for accurate predictions in classification and regression tasks.

Aryan

May 31

Gradient Boosting For Regression - 1

Gradient Boosting is a powerful machine learning technique that builds strong models by combining many weak learners. It works by training each model to correct the errors of the previous one using gradient descent. Fast, accurate, and widely used in real-world applications, it’s a must-know for any data science enthusiast.

Aryan

May 29

Random Forest Part - 2

Why Ensemble Techniques Work: The "Wisdom of Crowds" Ensemble methods derive their power from the principle known as the "wisdom of...

Aryan

May 25

Random Forest Part - 1

Introduction to Random Forest Random Forest is a versatile and widely used machine learning algorithm that belongs to the class of...

Aryan

May 25

Demystifying Bagging in Machine Learning :

Bagging, short for Bootstrap Aggregating, is a powerful ensemble learning technique that has become a cornerstone of many high-performing...

Aryan

May 18

DECISION TREES - 3

Decision trees measure feature importance via impurity reduction (e.g., Gini). Overfitting occurs when trees fit noise, not patterns. Pruning reduces complexity: pre-pruning uses max depth or min samples, while post-pruning, like cost complexity pruning, trims nodes after growth. These methods enhance generalization, improving performance on new data, making them vital for effective machine learning models.

Aryan

May 17

1 24 5

All Posts

XGBoost For Regression

Introduction to XGBoost

Gradient Boosting For Classification - 2

Gradient Boosting For Classification - 1

Gradient Boosting For Regression - 2

Gradient Boosting For Regression - 1

Random Forest Part - 2

Random Forest Part - 1

Demystifying Bagging in Machine Learning :

DECISION TREES - 3

© 2025 Aryan Upadhyay |