機器學習常用數據集大全


機器學習常用數據集大全


UCI Machine Learning Adult Dataset

Business Problem: Classification (a person earns more than 50k or less) Predictor Variable: Label ; Predictors: country, age, education, occupation, marital status etc.

文章:https://towardsdatascience.com/pandas-index-explained-b131beaf6f7b
數據集地址:https://archive.ics.uci.edu/ml/machine-learning-databases/adult/adult.data

Kaggle – Avazu:Click-Through Rate Prediction

Predict whether a mobile ad will be clicked
In online advertising, click-through rate (CTR) is a very important metric for evaluating ad performance. As a result, click prediction systems are essential and widely used for sponsored search and real-time bidding.

Kaggle地址:
https://www.kaggle.com/c/avazu-ctr-prediction/overview

UCI – Adult Data Set $50k

Predict whether income exceeds $50K/yr based on census data. Also known as “Census Income” dataset
https://archive.ics.uci.edu/ml/datasets/Adult

UCI – Iris Data Set

This is perhaps the best known database to be found in the pattern recognition literature. Fisher’s paper is a classic in the field and is referenced frequently to this day
https://archive.ics.uci.edu/ml/datasets/Iris

Kaggle Titanic: Machine Learning from Disaster

use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

https://www.kaggle.com/c/titanic


分享到:


相關文章: