Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

支持qwen吗 #6027

Open
Storm0921 opened this issue Aug 22, 2024 · 3 comments
Open

支持qwen吗 #6027

Storm0921 opened this issue Aug 22, 2024 · 3 comments

Comments

@Storm0921
Copy link

No description provided.

@Issues-translate-bot
Copy link

Bot detected the issue body's language is not English, translate it automatically. 👯👭🏻🧑‍🤝‍🧑👫🧑🏿‍🤝‍🧑🏻👩🏾‍🤝‍👨🏿👬🏿


Title: Support qwenuo

@Storm0921
Copy link
Author

image
image
https://github.com/hpcaitech/ColossalAI/blob/main/applications/ColossalChat/examples/README.md

1.如果支持qwen的话应该怎么使用呢?使用ColossalChat去sft,rm,ppo?好像没看到支持pt?
2.Colossal-LLaMA这块是仅支持llama系列的pt和sft嘛?qwen这种和llama结构基本一致的不能套用进来?
3.coati里看上去很多脚本,能拿来做训练吗?是干啥用的

@Storm0921 Storm0921 changed the title 支持qwe糯 支持qwen吗 Aug 22, 2024
@wangbluo
Copy link
Contributor

wangbluo commented Aug 26, 2024

Hi, Colossal-LLama is not for qwen model, as they have different prompts.
You can use ColossalChat to do sft,rm,ppo but pt.
If your gpu resources is limit, we recommend you to use lora strategies.
Coati is for sft, and rlhf.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants