EADST

Train XGBoost Model with Pandas Input

Train XGBoost Model with Pandas Input

import warnings
warnings.filterwarnings("ignore")
import pandas as pd
import numpy as np
import xgboost as xgb
from sklearn.metrics import classification_report

train=pd.read_csv('./train.csv')
test=pd.read_csv('./test.csv')


info=pd.read_csv('info.csv')
print(info.head()) # column name
print(info.shape)
new_info = info.drop_duplicates(subset=['id']) # remove duplicate row with same id
train2=pd.merge(train, new_info[['id', 'number']], how='left', on='id').fillna(0) # merge table horizontally

train_y=train2['result']
train_x=train2.drop(columns=['uaid','result','others'])
test_id = test['id']
test_y=test['result']
test_x=test.drop(columns=['uaid','result','others'])


model = xgb.XGBClassifier()
model.fit(train_x, train_y)
train_predict_y = model.predict(train_x)
print(classification_report(train_y, train_predict_y))


result=model.predict_proba(test_x)
result=pd.concat([test_y,pd.DataFrame(result)],axis=1)
result.to_csv('./test_result.csv')
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
财报 API Mixtral Diagram 净利润 Web PIP Pillow SQLite 云服务器 Paddle Image2Text Excel Search 公式 Land COCO torchinfo Hotel NameSilo PDB GPT4 GGML BeautifulSoup FP16 News mmap Qwen Color transformers CC Claude Distillation Clash LLM diffusers Vim UI Conda PyCharm Cloudreve v0.dev 飞书 签证 Card OpenAI Windows Translation 报税 uWSGI 算法题 Statistics NLTK Heatmap Disk ModelScope Django QWEN Bin Jetson CUDA 强化学习 IndexTTS2 Safetensors Dataset Math VPN Paper CV NLP Tiktoken Pickle 域名 BTC Review Template Git Plate Food 版权 Baidu Breakpoint Streamlit Michelin Augmentation 多线程 Quantize LLAMA Tracking Docker Video logger Github Linux EXCEL InvalidArgumentError ChatGPT AI Google tqdm Hilton Shortcut HuggingFace git-lfs Random HaggingFace Data TTS Llama CLAP TensorFlow Bitcoin FastAPI RGB 第一性原理 VGG-16 icon JSON Input Permission 腾讯云 v2ray tar UNIX Bipartite 顶会 Website LaTeX TensorRT BF16 Miniforge Domain Logo Qwen2.5 Zip Rebuttal CAM OCR MD5 Pandas uwsgi SPIE Nginx ONNX Transformers Animate git SQL WAN Algorithm Agent CEIR FP64 继承 Numpy 图形思考法 Vmess Knowledge Freesound Ptyhon Tensor LoRA DeepSeek Qwen2 关于博主 GIT VSCode XML hf Attention Proxy SAM 图标 CTC Pytorch RAR PyTorch Crawler SVR Gemma FlashAttention Interview LeetCode Firewall Base64 FP8 Password PDF OpenCV Datetime scipy YOLO Magnet FP32 证件照 Sklearn Use 多进程 WebCrawler C++ 音频 DeepStream XGBoost 递归学习法 printf Anaconda Jupyter 搞笑 GoogLeNet llama.cpp 阿里云 CSV Markdown Plotly TSV Ubuntu Python Hungarian Bert ResNet-50 Quantization GPTQ
站点统计

本站现有博文324篇,共被浏览809112

本站已经建立2512天!

热门文章
文章归档
回到顶部