Machine Learning – Tenosr's notebook

Gradient Boosted Decision Trees (GBDT)

Classical Algorithm, In English, Machine Learning

This notebook builds a complete understanding of GBDT from the ground up: 1. Decision Trees: The Weak Learner GBDT builds an ensemble of decision trees. Each tree partitions […]

Expected Prediction Error and the Regression Function

Base, In English, Uncategorized

Source: Elements of Statistical Learning, Section 2.4 This notebook walks through the theoretical framework for supervised learning: why squared error loss is natural, what EPE is, and why […]

Derivation of the Least Squares Solution

Base, In English

This notebook derives the ordinary least squares (OLS) solution $\hat{x} = (A^\top A)^{-1} A^\top b$ from first principles using matrix algebra. 1. Problem Setup We want to solve […]

Step by Step实现RAG

LLM

RAG(Retrieval and generation)技术可以扩展大模型的知识库，用来回答我们特定问题，这里我们Step by Step 实现RAG技术。

PRML Chapter 1

In English, Machine Learning, PRML

1.1 Example: Polynomial Curve Fitting Now suppose that we are given a training set comprising $N$ observations of $x$, written $\textbf{x} = (x_1, …, x_N)^{T}$ ,tother with corresponding […]

Mathematical notation

Book, In English, Machine Learning, PRML

Vectors are denoted by lower case bold Roman letters such as $\textbf{x}$, and all vectos are assumed to be column vectors. A superscript $T$ denotes the transpose of […]

Llama 重写日志[未完…]

Coding, LLM

很遗憾重写失败了，官方inference使用了meta之前的一个包fairscale，很麻烦，后面有大段的空闲时间的话再捡起来。

【文章发布的比较早，新版sklearn已经使用Rust重写了，只能用来凑热闹了】 sklearn中对GBDT的实现是完全遵从论文 Greedy Function Approximation的，我们一起来看一下是怎么实现的。GBDT源码最核心的部分应该是对Loss Function的处理，因为除去Loss部分的代码其他的都是非常直觉且标准的程序逻辑，反正我们就从sklearn对loss的实现开始看吧～～ Loss Function 的实现以二分类任务为例，loss采用Binomial Deviance，看这个loss很陌生，其实跟我们熟悉的negative log-likelihood / cross entropy 是一回事，因为是二分类问题嘛，模型最终输出其实就是$P(y=1|x)$，即样本$x$是正例的概率，我们把这个概率标记成$p(x)$，那么Binomial Deviance等于 $$\ell(y, F(x)) = -\left [ y\log(p(x)) + (1 – y)\log(1-p(x)) \right […]

Multi-Head Attention 计算过程

Base, LLM, Machine Learning

直觉的理解Attention和Multi-Head Attention的计算过程，然后咱们用NumPy来实现下。

XGBoost自定义目标函数

Coding, Machine Learning

xgboost内置了足够丰富的目标函数(objective function)，正常来说是能够应付日常需求的，如果～万一～你有特殊需求，它也可以自定义目标函数，或者叫损失函数(loss function)，这里介绍下怎么自定义目标函数。