关于ChatYuan-large对话机器人在小藤上试验的奇妙旅程
【摘要】 关于ChatYuan-large对话机器人在小藤上试验的奇妙旅程
参考 JeffDing的博客:https://bbs.huaweicloud.com/blogs/405188
将ubuntu 22.04 X86虚拟机的内存改成32G:
data:image/s3,"s3://crabby-images/15fc2/15fc2e1d9803162994ef6028abb63302d8bd1949" alt=""
打开虚拟机查看内存使用情况:
data:image/s3,"s3://crabby-images/2beab/2beab64693270d8154810ed25ef5ce9bb0edddcd" alt=""
下载并安装CANN 6.2RC1 X86版本:
wget https://ascend-repo.obs.cn-east-2.myhuaweicloud.com/CANN/CANN%206.2.RC1/Ascend-cann-toolkit_6.2.RC1_linux-x86_64.run
data:image/s3,"s3://crabby-images/ffcbb/ffcbbfd9c67586c1c2002309bf00f049bffdc49d" alt=""
chmod a+x Ascend-cann-toolkit_6.2.RC1_linux-x86_64.run
data:image/s3,"s3://crabby-images/ebe39/ebe3907b1f236cec343c0b387c4e8531f0e7d973" alt=""
报错了,说Python不是3.7-3.9之中的版本。。
仔细检查一下:
data:image/s3,"s3://crabby-images/40446/40446d317e11e6f9cb5954c525c02d8b1cda3f87" alt=""
确实如此。
那么使用conda创建一个Python 3.9的conda环境:
data:image/s3,"s3://crabby-images/03bb8/03bb8d984f572978aa16daee6fba2dcf92e70a18" alt=""
data:image/s3,"s3://crabby-images/9c823/9c823f24fb2ea9b2c07ba96aa230797ada456f6e" alt=""
重新执行CANN的安装:
data:image/s3,"s3://crabby-images/23347/23347be0cd760eaabf24244a3e28b7cae94e9f5e" alt=""
data:image/s3,"s3://crabby-images/875a0/875a08a5f7349140abbb6d5fed06d718d9682fcd" alt=""
根据提示,在~/.bashrc中增加以下行:
data:image/s3,"s3://crabby-images/8f492/8f49290e6f969b91752a29070865422fc5ab9273" alt=""
查找 libascend_hal.so的位置:
data:image/s3,"s3://crabby-images/274b9/274b9b9a2b9cf0d086c9feea670f3c0364908c0d" alt=""
将其路径加入到 LD_LIBRARY_PATH环境变量中:
data:image/s3,"s3://crabby-images/beb56/beb568645ae545600acc56b5616507d391d050cb" alt=""
验证模型转换atc命令是否正常:
data:image/s3,"s3://crabby-images/124b8/124b8157ca7d213f89fe80e05bab9c8010d921a3" alt=""
下载昇腾官方代码:git clone https://gitee.com/HUAWEI-ASCEND/ascend-devkit
data:image/s3,"s3://crabby-images/6b796/6b7965e4a8b07a40bd61ce7a38998a2025dafff7" alt=""
将ChatYuan-large对话机器人项目的ModelConvert目录单独复制出来:
data:image/s3,"s3://crabby-images/395ef/395ef4994f4dce770b5ee8f027977541ed052b40" alt=""
进入该目录,安装相关依赖:
data:image/s3,"s3://crabby-images/b7ff2/b7ff2766f96049bf2e2cee15c4f3562b0b02fdf7" alt=""
data:image/s3,"s3://crabby-images/f5ae0/f5ae028c3d6949b7cf2c3a34fe2a44c05d94f573" alt=""
抱歉,网络环境不好,超时了。
安装时加入清华源,重试:
data:image/s3,"s3://crabby-images/0b06e/0b06edcec574598a6c03bcd5bca2429cbd6061cd" alt=""
data:image/s3,"s3://crabby-images/64278/6427879e0da82c5b37f03a1116a17598ba9da24d" alt=""
data:image/s3,"s3://crabby-images/60361/60361b258bbc0228790c7ecea9582671cb191fb2" alt=""
查看剩余空间:
data:image/s3,"s3://crabby-images/23397/2339740e877f10c315985442b0a3656d96b01627" alt=""
感觉有点不够用了。
虚拟机关机,扩容:
data:image/s3,"s3://crabby-images/14aac/14aac161948e2fc76d730b1444ab421404b842fb" alt=""
data:image/s3,"s3://crabby-images/a321d/a321d27a036f372594390e4ca3fa7cd46cba5397" alt=""
虚拟机开机,安装gparted:
sudo apt-get install gparted
data:image/s3,"s3://crabby-images/70054/70054a5e57d84447e5126ca387395f182b61958f" alt=""
sudo gparted启动分区软件:
data:image/s3,"s3://crabby-images/8139f/8139fa226e75fe377cc5d75ddee677798b02b3cf" alt=""
报错了,无法扩容?
在/dev/sda3上,点击右键,信息:
data:image/s3,"s3://crabby-images/ab340/ab340ed4fd5ff7d4a59b90c0085c20e0311ef44b" alt=""
挂载于 /, /var/snap/firefox/common/host-hunspell
sudo -i
mount -o remount -rw /
mount -o remount -rw /var/snap/firefox/common/host-hunspell
data:image/s3,"s3://crabby-images/20dc1/20dc1da54c4f7235b9b9fc8a6f2a460082a17577" alt=""
重新分区,将滚动条拖到底:
data:image/s3,"s3://crabby-images/8501b/8501bd804ea4cd2dfac93377e62265dd58a8b9cd" alt=""
点击调整大小
data:image/s3,"s3://crabby-images/98d8d/98d8de360ca1c0dedf47f3ccf2fa9c46aed99113" alt=""
再点击上面绿色的钩子:
data:image/s3,"s3://crabby-images/1d4b1/1d4b1018a8c113f2d3ad605ea6e79c1b61cfd439" alt=""
点击应用。
data:image/s3,"s3://crabby-images/07db5/07db5d3384d5c9da7cce7c39b0ba6d2f020b802d" alt=""
关闭窗口,重启虚拟机。
可以看到已经增加了20G空间:
data:image/s3,"s3://crabby-images/1ccfe/1ccfeda47ede464fb8870304db92c5a3fe9a47ac" alt=""
cd ModelConvert
conda activate chatyuan
导出CHatYuan-Large-V2模型
先导出encoder模型:
python export_encoder.py --version=v2
data:image/s3,"s3://crabby-images/cd28a/cd28a5cb78271c35ac07e8e3d5d7b2f7472848b6" alt=""
耐心等待执行完毕:
data:image/s3,"s3://crabby-images/2a091/2a0918f15453bc39a19ff2ed09c0202b2b9f67b8" alt=""
再导出第一次自回归的decoder与lm_head部分模型
python export_decoder_first.py --version=v2
data:image/s3,"s3://crabby-images/4a763/4a76305221c2af418b077e6d263d690cb915e980" alt=""
这里报错了吗?感觉有,又感觉没有。。
看到文件好像也生成了:
data:image/s3,"s3://crabby-images/dc35e/dc35eeacacbb615b0a342edd0b402c7748f5399c" alt=""
最后导出第2-N次自回归的decoder与lm_head部分模型
python export_decoder_iter.py --version=v2
data:image/s3,"s3://crabby-images/34382/34382caccb61dcda1cb062b42653d608ec58801c" alt=""
生成onnx文件如下:
data:image/s3,"s3://crabby-images/3e326/3e326034929d13a29dcc939c18a5ffbdead18370" alt=""
分别对三个模型做裁剪:
onnxsim ./encoder.onnx ./encoder_sim.onnx
data:image/s3,"s3://crabby-images/def29/def2915e768993d61f29cfd7c80723911b263568" alt=""
onnxsim ./decoder_first.onnx ./decoder_first_sim.onnx
data:image/s3,"s3://crabby-images/742be/742be117c05e919b81e5aa2700583df5f3a23f35" alt=""
data:image/s3,"s3://crabby-images/2a550/2a5506f0e33c0c5f4a73e1f1ba810dc38bffbe4e" alt=""
onnxsim ./decoder_iter.onnx ./decoder_iter_sim.onnx
data:image/s3,"s3://crabby-images/82468/82468174d8ae3f99bf13dc3aeece8bd18be2c478" alt=""
这个模型转换时间比较长,我们可以查看top看看是不是在跑:
data:image/s3,"s3://crabby-images/5f1a2/5f1a2a428a8b6d9fbb2410ad263fa74f4bc32325" alt=""
看来是耗CPU比较厉害。下次给虚拟机多分配几个CPU再试试能不能提速。
data:image/s3,"s3://crabby-images/86b80/86b80cfa194bcfbaf5ef62dce2a2cc28a2a99342" alt=""
data:image/s3,"s3://crabby-images/442a5/442a55234bc5f04d9788e6cc61016595c71001ee" alt=""
转换完成。
data:image/s3,"s3://crabby-images/4ec6b/4ec6b1f017b4263101d4f57eeb731430664ab26d" alt=""
结果就是以上三个sim的onnx模型。
对decode部分做int8量化:
python quant_decoder.py
data:image/s3,"s3://crabby-images/227ce/227cecba0284da0ce971c7201c7ca4f4d108a591" alt=""
data:image/s3,"s3://crabby-images/85594/8559468120005e319c117d19b3f5dfc2d5657b2f" alt=""
生成了两个quant的onnx模型:
data:image/s3,"s3://crabby-images/da682/da68270cce62f1cb658107bdaf889e47b8baba3e" alt=""
将裁剪后的encoder模型转为混合精度的om模型:
atc --model=encoder_sim.onnx \ --framework=5 \ --soc_version=Ascend310B1 \ --output=encoder \ --input_format=ND \ --input_shape="input_ids:1,768;attention_mask:1,768" \ --precision_mode=allow_fp32_to_fp16
data:image/s3,"s3://crabby-images/2d60f/2d60f1ea7507350a1775d235e8f02da8a8b79991" alt=""
好像缺包:No module named 'decorator
pip install decorator -i https://pypi.tuna.tsinghua.edu.cn/simple
data:image/s3,"s3://crabby-images/4caf6/4caf68c5676bde51162f8120ab9314a64e241dd6" alt=""
再来:
atc --model=encoder_sim.onnx \ --framework=5 \ --soc_version=Ascend310B1 \ --output=encoder \ --input_format=ND \ --input_shape="input_ids:1,768;attention_mask:1,768" \ --precision_mode=allow_fp32_to_fp16
data:image/s3,"s3://crabby-images/66e09/66e0938a2921c14a3281286e809500575f0c9227" alt=""
唉,一一补齐吧!
pip install scipy -i https://pypi.tuna.tsinghua.edu.cn/simple
data:image/s3,"s3://crabby-images/6efad/6efad30e32441db481b2849d37bc8bf62c61dc86" alt=""
atc --model=encoder_sim.onnx \ --framework=5 \ --soc_version=Ascend310B1 \ --output=encoder \ --input_format=ND \ --input_shape="input_ids:1,768;attention_mask:1,768" \ --precision_mode=allow_fp32_to_fp16
data:image/s3,"s3://crabby-images/8b548/8b548529dac01086c3e7614a93d3d044740b0dc6" alt=""
data:image/s3,"s3://crabby-images/da8f2/da8f25267489578d909c01f108c8a7502cebede0" alt=""
data:image/s3,"s3://crabby-images/4027a/4027a81c6f9f6a380f6a7006d1a7b17d3387ece9" alt=""
data:image/s3,"s3://crabby-images/431b2/431b286417a971f566f9a8cbc475ecc5b30945cc" alt=""
再来:
data:image/s3,"s3://crabby-images/15fe4/15fe4ec9779024fffdd0d428af8c0409977c49a6" alt=""
终于好像没报错了,耐心等待转换结束:
data:image/s3,"s3://crabby-images/1e55a/1e55a7efd200ca72c3082e97691ce6fa4ed2c590" alt=""
在X86上转换确实很快。(得益于张小白的强大PC。。)
data:image/s3,"s3://crabby-images/c3bd6/c3bd6b422a5cf70f40d62fcaf52904c9765517a8" alt=""
在200DK上执行以下动作:
下载官方代码:
git clone https://gitee.com/HUAWEI-ASCEND/ascend-devkit
data:image/s3,"s3://crabby-images/37fed/37fed424b87db61a7185016f479e5ff763bb8623" alt=""
切换到root用户,关闭oom killer:
su -
data:image/s3,"s3://crabby-images/f16d0/f16d078004f99ececb1abcdc95e24c667bb97191" alt=""
vi /etc/rc.local
data:image/s3,"s3://crabby-images/7f0f2/7f0f2eb3b97f0989751fa00882e8f44c0e7aef35" alt=""
注释掉以上2行。
vi /sys/fs/cgroup/memory/usermemory/memory.oom_control
data:image/s3,"s3://crabby-images/1bf8e/1bf8e4f76c6951966fd32ce924f9f1bda05cf534" alt=""
echo 1 > /sys/fs/cgroup/memory/usermemory/memory.oom_control
重新查看:vi /sys/fs/cgroup/memory/usermemory/memory.oom_control
data:image/s3,"s3://crabby-images/58be7/58be79a11fc58f0e67888ff931d4049f23dba8ef" alt=""
可见disable已经变成了1
echo $$ > /sys/fs/cgroup/memory/usermemory/tasks
data:image/s3,"s3://crabby-images/3c7ad/3c7ad3c4806e4fbdae8da08e36e704f16a279451" alt=""
init 6重启开发板
data:image/s3,"s3://crabby-images/64c13/64c13594271c6f20030daf85c19fffd5b6ca2070" alt=""
等重启好后,登录开发板,增加swap分区:
free -h
su - 切换到root
fallocate -l 2.5G /swapfile
chmod 600 /swapfile
mkswap /swapfile
swapon /swapfile
free -h
data:image/s3,"s3://crabby-images/30486/304862a0f055f26ce5c1244fa8aba7a34cd78410" alt=""
这里分区虽然设置了2.5G,但是张小白以前曾经创建过4G的swapfile,所以swap空间还是扩到了4G。
切换到V2model的目录:
cd /home/HwHiAiUser/ascend-devkit/src/E2E-Sample/ChatBot/Demo_V2
pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple
data:image/s3,"s3://crabby-images/43975/43975690eb7284b8fb21df488084a99b677ff680" alt=""
data:image/s3,"s3://crabby-images/a95a8/a95a88a237f7c6ff56b959b3cfdb4965d3ce4f81" alt=""
分别在PC端浏览器下载以下文件:
data:image/s3,"s3://crabby-images/66d49/66d49cd89718b0c9ff6498a39f0df9a1ef41bddb" alt=""
(model1的暂时不要下载)
将tokenizer_file.zip和 dist_chatbot_standalone.zip文件传到开发板 /home/HwHiAiUser目录下,并使用unzip解压:
data:image/s3,"s3://crabby-images/40d73/40d73f6ab0db8435f81059acb4f6d3719bafbf98" alt=""
将tokenizer_file目录下的文件复制到 /home/HwHiAiUser/ascend-devkit/src/E2E-Sample/ChatBot/Demo_V2/tokenizer_file 目录下:
data:image/s3,"s3://crabby-images/9f406/9f4060f79fbee2dc6bd47f45fce3614a9dd260ea" alt=""
将dist_chatbot_standalone目录下的文件复制到/home/HwHiAiUser/ascend-devkit/src/E2E-Sample/ChatBot/Demo_V2/dist_chatbot_standalone 目录下:
data:image/s3,"s3://crabby-images/b1adb/b1adb3e53e849e18fdc6fcd0103a7d9d2ed1059b" alt=""
将前面转换好的模型 encoder.om(790M)、decoder_iter_sim_quant.onnx(425M)、decoder_first_sim_quant.onnx(475M) 传输到/home/HwHiAiUser/ascend-devkit/src/E2E-Sample/ChatBot/Demo_V2/models目录下:
data:image/s3,"s3://crabby-images/0a7c8/0a7c8419830ef646db9739e15bd2d17fcf548bde" alt=""
(可以使用MobaXterm先Download到windows,再上传到开发板,或者直接scp过去)
data:image/s3,"s3://crabby-images/ba6db/ba6db5538a4c440e0ac7f562fa9e81ff994ba042" alt=""
data:image/s3,"s3://crabby-images/e53a5/e53a5124b69236beb03179a042794d0fa95ca16d" alt=""
data:image/s3,"s3://crabby-images/36751/36751513ccc1ee15a791a43041c14796c9f0a5dc" alt=""
data:image/s3,"s3://crabby-images/0e150/0e15018026bc5f172e787668cf574e30a7354663" alt=""
耐心等待传输完毕:
data:image/s3,"s3://crabby-images/b6381/b6381647a3d27728446c2a83644da07aef5788d9" alt=""
在开发板上,使用HwHiAiUser用户登录
cd /home/HwHiAiUser/ascend-devkit/src/E2E-Sample/ChatBot/Demo_V2
执行python main.py
data:image/s3,"s3://crabby-images/70a08/70a08b726c13038c100e9dcb8738570184eb9680" alt=""
还是换成root登录吧:
data:image/s3,"s3://crabby-images/50857/5085798d3779ad0239c3a427d6f3299615d1bccd" alt=""
data:image/s3,"s3://crabby-images/d925f/d925ff6495fcc6cd76ec6804d5633b4325f30f00" alt=""
输入对话:
写一首情诗
data:image/s3,"s3://crabby-images/e4445/e4445f2fa3c46b7b39cbadb53f24a8ba1468e3d6" alt=""
后台在不停地跳出数据:
data:image/s3,"s3://crabby-images/dcb96/dcb9669da7d8272afff12ecaf18ff814988c44f4" alt=""
但是页面上好像还是没有什么反应。。。
查看top,CPU好像在不停地计算:
data:image/s3,"s3://crabby-images/c128c/c128cffb1505dec9334888dadd286ea404e51490" alt=""
会不会卡死了。。张小白偷偷地想。。。
张小白在纠结是耐心等待呢?还是关掉浏览器重新打开试试看看。。
关掉吧!
重新打开窗口:
data:image/s3,"s3://crabby-images/f1e71/f1e71ba17c2014afad1e87cef7e1b4ce04804955" alt=""
终端显示ChatBot Ready了。
data:image/s3,"s3://crabby-images/93476/934762034d8447dcb9cb0fca6ec5ac60afa3ce0a" alt=""
data:image/s3,"s3://crabby-images/46c11/46c11c9d1ed54bcf8901c2b41c60b16f2150acfd" alt=""
唉,ChatBot在回答第一个问题。。。它在写情诗。。
data:image/s3,"s3://crabby-images/11c79/11c79b69dcbbe27da2a7c7b953c186a31b17b4d9" alt=""
这首诗写得嘛。。。。。真不怎么样。。
data:image/s3,"s3://crabby-images/aef6c/aef6c2428c5bbbbead394f407479e6328e293d7e" alt=""
但是好在它工作了!
再提个问题:
data:image/s3,"s3://crabby-images/5585c/5585c82e6b3343edc062938ca6044a5586b2513e" alt=""
完了,感觉ChatBoT有点崩溃了。
而我是玩坏了它的那个人。。。。
data:image/s3,"s3://crabby-images/122c2/122c2d8dda3903215fea3a1311b3c4fb242fb18e" alt=""
我觉得还是把后台重启一下吧!
python main.py
还是等到ChatBoT Ready再提问吧!
data:image/s3,"s3://crabby-images/87a48/87a48320b8b2b3b270fdeb68a653e54f0447011f" alt=""
张小白大概理解了,就是在ChatBot没有正式启动之前,过早地打开了浏览器跟开发板服务进行了交互,结果屏幕上全是交互的内容,把ChatBot启动的部分给扫过去了。。。
但这个启动确实有点慢,比FastChat要slow一点。。。
data:image/s3,"s3://crabby-images/5e48b/5e48b2c86af8ab8a9d2ad9289db4b999bf0f9656" alt=""
data:image/s3,"s3://crabby-images/71a3b/71a3b4cdee879fea7ada95fba16c66b765b7c692" alt=""
好了,Ready了。提个问题吧!
data:image/s3,"s3://crabby-images/477ba/477baf0d1f431ca7a500ecced6995c6411ae1f66" alt=""
data:image/s3,"s3://crabby-images/d23aa/d23aa152b668334c56de69a4e0a8a47e899d6aac" alt=""
果然后台开始刷屏了。但是页面还没有回复。。
耐心等待吧!
开始回答了:
data:image/s3,"s3://crabby-images/ca506/ca506e00fb3bd9740d0612895c8433ffcf5ab053" alt=""
但是,为啥它还是有点抽风呢。。。
data:image/s3,"s3://crabby-images/2260f/2260fa2145227096bbd7a1f0532e6e0899782335" alt=""
data:image/s3,"s3://crabby-images/f7635/f7635f60aba74c157f18475ea820cd760b3fbb88" alt=""
总算看到篇像样的文字了。
真难为ChatBot了。
补充一句:由于这个例子是在E2E的目录下的,所以张小白使用的是E2E的镜像。
(全文完,谢谢阅读)
【声明】本内容来自华为云开发者社区博主,不代表华为云及华为云开发者社区的观点和立场。转载时必须标注文章的来源(华为云社区)、文章链接、文章作者等基本信息,否则作者和本社区有权追究责任。如果您发现本社区中有涉嫌抄袭的内容,欢迎发送邮件进行举报,并提供相关证据,一经查实,本社区将立刻删除涉嫌侵权内容,举报邮箱:
cloudbbs@huaweicloud.com
- 点赞
- 收藏
- 关注作者
作者其他文章
评论(0)