Blog posts

2024

Boruta Feature Selection

1 minute read

Published:

Purpose

Boruta is designed to determine which variables (features) are significant in predicting the outcome with the given dataset. It is particularly useful when dealing with high-dimensional data.

Kernel Density Estimation

1 minute read

Published:

It has been a while since I last took some learning notes - I have been buried in work and trying to figure out a balance, and wasting my time …

Convolutional Neural Networks

less than 1 minute read

Published:

什么是卷积神经网络(CNN)?它在计算机视觉中的应用是什么?

Basic pandas

less than 1 minute read

Published:

Python, data science, pandas

Basic numpy

less than 1 minute read

Published:

Python, data science, numpy

Happy 31th birthday

less than 1 minute read

Published:

今天不谈机器学习,不再整理一些记不住的知识,而是在三十又一的当下,记录一下最近的思考。

Clustering alrogithms

less than 1 minute read

Published:

gradient descent, stochastic gradient descent, batch gradient descent

Activation Functions

1 minute read

Published:

激活函数(Activation Function),负责将神经元的输入映射到输出端,激活函数将神经网络中将输入信号的总和转换为输出信号。激活函数大多是非线性函数,才能将多层感知机的输出转换为非线性,使得神经网络可以任意逼近任何非线性函数,进而可以应用到众多的非线性模型中。

Loss Functions

1 minute read

Published:

Regression Loss Functions

Time-series Forecasting

5 minute read

Published:

ARIMA (Autoregressive Integrated Moving Average)

It is a popular statistical analysis model used for forecasting time series data. ARIMA models are especially well-suited for short to medium-term forecasting models that have data with trends, seasonality, or cyclic patterns. The model aims to describe the autocorrelations in the data.