- 微信
- 微博
  
  分享文章到微博
- 复制链接
  
  复制链接到剪贴板

基于llama-factory的昇腾实战

yd_294961020 发表于 2025/06/20 10:23:41 2025/06/20

【摘要】环境安装（在安装好cann的容器执行）git clone https://github.com/hiyouga/LLaMA-Factory.gitcd LLaMA-Factorypip install -e ".[torch-npu,metrics]"# 查看安装的版本llamafactory-cli env 全流程实践原始模型推理cd LLaMA-Factory# 需要提前下载好模型权...

环境安装（在安装好cann的容器执行）

git clone https://github.com/hiyouga/LLaMA-Factory.git
cd LLaMA-Factory
pip install -e ".[torch-npu,metrics]"

# 查看安装的版本
llamafactory-cli env

全流程实践

原始模型推理

cd LLaMA-Factory

# 需要提前下载好模型权重：如/weights/Qwen1.5-0.5B-Chat/ 
ASCEND_RT_VISIBLE_DEVICES=0 llamafactory-cli chat --model_name_or_path /weights/Qwen1.5-0.5B-Chat/  --template qwen

# 模型部署好可以进行对话

基于LORA的sft指令微调

创建yaml文件，如qwen.yaml

### model
model_name_or_path: /weights/Qwen1.5-0.5B-Chat

### method
stage: sft
do_train: true
finetuning_type: lora
lora_target: q_proj,v_proj

### ddp（开启deepspeed需要跑多卡）
#ddp_timeout: 180000000
#deepspeed: examples/deepspeed/ds_z0_config.json

### dataset
dataset: identity,alpaca_en_demo
template: qwen
cutoff_len: 1024
max_samples: 1000
overwrite_cache: true
preprocessing_num_workers: 16

### output
output_dir: saves/Qwen1.5-0.5B-Chat/lora/sft
logging_steps: 10
save_steps: 500
plot_loss: true
overwrite_output_dir: true

### train
per_device_train_batch_size: 1
gradient_accumulation_steps: 2
learning_rate: 0.0001
num_train_epochs: 1.0
lr_scheduler_type: cosine
warmup_ratio: 0.1
fp16: true

### eval
val_size: 0.1
per_device_eval_batch_size: 1
#evaluation_strategy: steps
eval_steps: 500

执行微调

cd LLaMA-Factory
ASCEND_RT_VISIBLE_DEVICES=0 llamafactory-cli train qwen1_5_lora_sft_ds.yaml

动态合并LoRA的推理

ASCEND_RT_VISIBLE_DEVICES=0 llamafactory-cli chat --model_name_or_path /weights/Qwen1.5-0.5B-Chat/  --template qwen --adapter_name_or_path saves/Qwen1.5-0.5B-Chat/lora/sft --finetuning_type lora

benchmark评测

llamafactory-cli eval --model_name_or_path /weights/Qwen1.5-0.5B-Chat/ --template fewshot --task mmlu_test --lang en  --n_shot 5 --batch_size 1 --trust_remote_code true

点赞
收藏
关注作者

0/1000

抱歉，系统识别当前为高风险访问，暂不支持该操作

全部回复

上滑加载中

设置昵称

在此一键设置昵称，即可参与社区互动！

*长度不超过10个汉字或20个英文字符，设置后3个月内不可修改。

确认取消

加入云驻计划，成为创作者

华为云周边好礼
免费体验产品
特殊身份标识
线下官方门票
内部专家零距离
与10000+优质创作者共同成长

立即加入

基于llama-factory的昇腾实战

环境安装（在安装好cann的容器执行）

全流程实践

原始模型推理

基于LORA的sft指令微调

动态合并LoRA的推理

benchmark评测

全部回复

设置昵称

关于作者

目录

加入云驻计划，成为创作者

基于llama-factory的昇腾实战

环境安装（在安装好cann的容器执行）

全流程实践

原始模型推理

基于LORA的sft指令微调

动态合并LoRA的推理

benchmark评测

全部回复

设置昵称

关于作者

目录

热门推荐查看更多

相关文章

加入云驻计划，成为创作者

相关产品