EADST

Train XGBoost Model with Pandas Input

Train XGBoost Model with Pandas Input

import warnings
warnings.filterwarnings("ignore")
import pandas as pd
import numpy as np
import xgboost as xgb
from sklearn.metrics import classification_report

train=pd.read_csv('./train.csv')
test=pd.read_csv('./test.csv')


info=pd.read_csv('info.csv')
print(info.head()) # column name
print(info.shape)
new_info = info.drop_duplicates(subset=['id']) # remove duplicate row with same id
train2=pd.merge(train, new_info[['id', 'number']], how='left', on='id').fillna(0) # merge table horizontally

train_y=train2['result']
train_x=train2.drop(columns=['uaid','result','others'])
test_id = test['id']
test_y=test['result']
test_x=test.drop(columns=['uaid','result','others'])


model = xgb.XGBClassifier()
model.fit(train_x, train_y)
train_predict_y = model.predict(train_x)
print(classification_report(train_y, train_predict_y))


result=model.predict_proba(test_x)
result=pd.concat([test_y,pd.DataFrame(result)],axis=1)
result.to_csv('./test_result.csv')
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Bipartite Image2Text LaTeX git-lfs Land Qwen2.5 Qwen2 第一性原理 TSV OCR Diagram CEIR printf Sklearn Shortcut API CC Augmentation Interview Attention PDF Review ChatGPT Proxy Search Plotly Use 搞笑 OpenCV 多线程 Baidu Hotel Data Tensor DeepStream FlashAttention Bin Color Transformers TensorRT InvalidArgumentError Anaconda logger EXCEL Heatmap Rebuttal Excel LoRA Freesound 证件照 Permission Math 公式 顶会 Ubuntu Michelin Food Dataset XGBoost icon Base64 VGG-16 GIT Paddle BeautifulSoup Miniforge GGML RAR Bitcoin IndexTTS2 飞书 Plate Cloudreve PyCharm BF16 UNIX Vim GPTQ CTC 报税 torchinfo 音频 Mixtral Tracking GPT4 Animate Paper 腾讯云 JSON Django Windows NameSilo CUDA Website Jupyter Claude Password Domain Algorithm Bert LeetCode BTC ResNet-50 Video Template Markdown 强化学习 Pytorch WebCrawler SVR LLAMA uwsgi COCO llama.cpp 域名 TTS LLM 论文 RGB uWSGI SAM Disk YOLO FP16 HuggingFace Magnet Quantize PyTorch Llama FP64 Hungarian Quantization v0.dev CAM UI transformers git Input MD5 HaggingFace Web v2ray VSCode Logo scipy 继承 CSV Git 关于博主 Nginx Pickle FP32 论文速读 Linux Pillow 算法题 OpenAI Github Pandas Zip Hilton FP8 C++ 版权 WAN NLP Ptyhon Docker hf News GoogLeNet 递归学习法 SQL Vmess QWEN 签证 Tiktoken Python PIP 图标 阿里云 Distillation Clash 图形思考法 Google Translation Conda mmap 净利润 FastAPI Firewall SPIE PDB 多进程 Streamlit Numpy Crawler tqdm Agent CLAP NLTK 云服务器 Datetime Qwen Gemma XML VPN SQLite diffusers ONNX Breakpoint CV DeepSeek tar AI TensorFlow Knowledge Statistics Jetson Safetensors Random Card 财报 ModelScope
站点统计

本站现有博文327篇,共被浏览837081

本站已经建立2541天!

热门文章
文章归档
回到顶部