MNL(多项logit)

最新推荐文章于 2026-05-22 16:11:49 发布

原创

最新推荐文章于 2026-05-22 16:11:49 发布 · 6.4k 阅读

标签

#python #机器学习

收录于

本文介绍了如何使用scikit-learn库构建和评估多项逻辑回归模型。首先，通过`make_classification`生成了一个包含冗余特征的多分类数据集，接着配置了LogisticRegression模型，采用`multi_class='multinomial'`和`solver='lbfgs'`参数以支持多项式分类。然后，利用交叉验证方法`RepeatedStratifiedKFold`评估模型性能，结果显示平均分类准确率为68.1%。最后，展示了如何使用训练好的模型对新数据进行预测，并输出了各个类别的概率分布。

import sklearn
import pandas as pd
from sklearn import datasets
from sklearn import linear_model
from sklearn import metrics
from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

1. 生成数据

n_samples：样本数

n_features：特征数

n_informative：有效特征个数

n_redundant：冗余特征个数（有效特征的随机组合）

n_classes：样本类别

random_state：随机状态，用于确定创建数据集的随机数的生成

# 生成数据
from collections import Counter
from sklearn.datasets import make_classification   
# 定义数据集
X, y = make_classification(n_samples=1000, n_features=10, n_informative=5, n_redundant=5, n_classes=3