EADST

Train XGBoost Model with Pandas Input

Train XGBoost Model with Pandas Input

import warnings
warnings.filterwarnings("ignore")
import pandas as pd
import numpy as np
import xgboost as xgb
from sklearn.metrics import classification_report

train=pd.read_csv('./train.csv')
test=pd.read_csv('./test.csv')


info=pd.read_csv('info.csv')
print(info.head()) # column name
print(info.shape)
new_info = info.drop_duplicates(subset=['id']) # remove duplicate row with same id
train2=pd.merge(train, new_info[['id', 'number']], how='left', on='id').fillna(0) # merge table horizontally

train_y=train2['result']
train_x=train2.drop(columns=['uaid','result','others'])
test_id = test['id']
test_y=test['result']
test_x=test.drop(columns=['uaid','result','others'])


model = xgb.XGBClassifier()
model.fit(train_x, train_y)
train_predict_y = model.predict(train_x)
print(classification_report(train_y, train_predict_y))


result=model.predict_proba(test_x)
result=pd.concat([test_y,pd.DataFrame(result)],axis=1)
result.to_csv('./test_result.csv')
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Card YOLO OpenAI DeepSeek Bitcoin Augmentation Breakpoint LoRA LaTeX Shortcut 强化学习 Input Qwen PyTorch QWEN 公式 Pickle Distillation Git Markdown logger VPN NameSilo News PIP Attention Jetson Crawler Qwen2 ChatGPT Jupyter Datetime Vim Github Bert BeautifulSoup Color 版权 签证 InvalidArgumentError Permission Random JSON CEIR llama.cpp RAR CLAP 算法题 Llama Gemma VGG-16 tar XGBoost FP64 HaggingFace 搞笑 Disk 报税 SVR Docker Cloudreve 腾讯云 IndexTTS2 Video Password Claude Streamlit Quantization PDB OpenCV CV LLM Excel uWSGI Logo v2ray API Hotel NLTK Plate Michelin 飞书 Statistics Food Qwen2.5 Interview Bipartite scipy Data Django Review tqdm BTC Mixtral torchinfo MD5 Paddle Nginx git-lfs Windows 财报 TensorRT WAN Conda ModelScope Math CSV SQL OCR TSV Pytorch Linux 云服务器 LLAMA Tiktoken Template Domain DeepStream Image2Text Knowledge transformers Tensor Use SAM Clash COCO XML Dataset 递归学习法 Website VSCode Heatmap Vmess NLP 音频 阿里云 多进程 净利润 Animate Proxy HuggingFace CC FP8 Sklearn hf 顶会 EXCEL diffusers Hilton 图形思考法 Magnet printf LeetCode v0.dev Python Anaconda Bin FP16 GoogLeNet Paper Quantize BF16 GPTQ Ptyhon C++ CTC 域名 Safetensors CUDA ONNX FP32 Plotly TensorFlow Web FastAPI 关于博主 Algorithm Search uwsgi Pandas git Miniforge SQLite GGML SPIE 继承 Hungarian Google UI Firewall Transformers Agent Zip Numpy WebCrawler CAM 多线程 ResNet-50 Translation Pillow AI 第一性原理 mmap Freesound Land Tracking Ubuntu GPT4 PyCharm Diagram Base64 TTS PDF Baidu 证件照 GIT FlashAttention UNIX RGB
站点统计

本站现有博文321篇,共被浏览780045

本站已经建立2472天!

热门文章
文章归档
回到顶部