大模型学习与实践笔记（九）-编程知识

大模型学习与实践笔记（九）

一、LMDeply方式部署

使用 LMDeploy 以本地对话方式部署 InternLM-Chat-7B 模型，生成 300 字的小故事

2.api 方式部署

运行

结果：

显存占用：

二、报错与解决方案

在使用命令，对lmdeploy 进行源码安装是时，报错

1.源码安装语句

pip install 'lmdeploy[all]==v0.1.0'

2.报错语句：

Building wheels for collected packages: flash-attnBuilding wheel for flash-attn (setup.py) ... errorerror: subprocess-exited-with-error× python setup.py bdist_wheel did not run successfully.│ exit code: 1╰─> [9 lines of output]fatal: not a git repository (or any of the parent directories): .gittorch.__version__  = 2.0.1running bdist_wheelGuessing wheel URL:  https://github.com/Dao-AILab/flash-attention/releases/download/v2.4.2/flash_attn-2.4.2+cu118torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whlerror: <urlopen error Tunnel connection failed: 503 Service Unavailable>[end of output]note: This error originates from a subprocess, and is likely not a problem with pip.ERROR: Failed building wheel for flash-attnRunning setup.py clean for flash-attn
Failed to build flash-attn
ERROR: Could not build wheels for flash-attn, which is required to install pyproject.toml-based projects

3.解决方法

（1）在https://github.com/Dao-AILab/flash-attention/releases/ 下载对应版本的安装包

（2）通过pip 进行安装

pip install flash_attn-2.3.5+cu117torch2.0cxx11abiFALSE-cp310-cp310-linux_x86_64.whl

4.参考链接

https://github.com/Dao-AILab/flash-attention/issues/224

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.hqwc.cn/news/413728.html

如若内容造成侵权/违法违规/事实不符，请联系编程知识网进行投诉反馈email:809451989@qq.com，一经查实，立即删除！

大模型学习与实践笔记（九）

一、LMDeply方式部署

二、报错与解决方案

1.源码安装语句

2.报错语句：

3.解决方法

4.参考链接

相关文章

一款开源且不限制大小可以设置过期时间的支持分享的的开源文件共享系统picoshare 部署教程

目标检测--01

深度学习和机器学习中针对非时间序列的回归任务，有哪些改进角度？

阿里云ECS(CentOS镜像)安装docker

使用 MinIO 和 PostgreSQL 简化数据事件

基于R语言的NDVI的Sen-MK趋势检验

CSV文件中json列的处理2

手动添加测试用例配置输入参数和期望值

MacOS受欢迎的数据库开发工具 Navicat Premium 15 中文版

算法题-爬楼梯-不同思路解法

【iOS】——基于Vision Kit框架实现图片文字识别

.net core IResultFilter 的 OnResultExecuted和OnResultExecuting的区别