Mini-Omni：语言模型可以在流式传输中聆听、交谈和思考 4.38MB

fenglingguitar

资源文件列表:

mini-omni.zip 大约有80个文件

mini-omni/
mini-omni/tokenizer_config.json 1.26KB
mini-omni/lit_model.pth 135B
mini-omni/tokenizer.json 6.7MB
mini-omni/README.md 1.09KB
mini-omni/.gitattributes 1.48KB
mini-omni/model_config.yaml 848B
mini-omni/.git/
mini-omni/frameworkv3.jpg 167.82KB
mini-omni/.git/config 307B
mini-omni/.git/objects/
mini-omni/.git/HEAD 21B
mini-omni/.git/info/
mini-omni/.git/logs/
mini-omni/.git/description 73B
mini-omni/.git/hooks/
mini-omni/.git/refs/
mini-omni/.git/index 625B
mini-omni/.git/packed-refs 112B
mini-omni/.git/objects/95/
mini-omni/.git/objects/68/
mini-omni/.git/objects/3b/
mini-omni/.git/objects/33/
mini-omni/.git/objects/b3/
mini-omni/.git/objects/da/
mini-omni/.git/objects/f4/
mini-omni/.git/objects/eb/
mini-omni/.git/objects/pack/
mini-omni/.git/objects/7b/
mini-omni/.git/objects/2f/
mini-omni/.git/objects/6e/
mini-omni/.git/objects/info/
mini-omni/.git/objects/98/
mini-omni/.git/objects/a0/
mini-omni/.git/objects/a7/
mini-omni/.git/objects/a6/
mini-omni/.git/objects/46/
mini-omni/.git/info/exclude 240B
mini-omni/.git/logs/HEAD 189B
mini-omni/.git/logs/refs/
mini-omni/.git/hooks/commit-msg.sample 896B
mini-omni/.git/hooks/pre-rebase.sample 4.78KB
mini-omni/.git/hooks/pre-commit.sample 1.6KB
mini-omni/.git/hooks/applypatch-msg.sample 478B
mini-omni/.git/hooks/fsmonitor-watchman.sample 4.62KB
mini-omni/.git/hooks/pre-receive.sample 544B
mini-omni/.git/hooks/prepare-commit-msg.sample 1.46KB
mini-omni/.git/hooks/post-update.sample 189B
mini-omni/.git/hooks/pre-merge-commit.sample 416B
mini-omni/.git/hooks/pre-applypatch.sample 424B
mini-omni/.git/hooks/pre-push.sample 1.34KB
mini-omni/.git/hooks/update.sample 3.56KB
mini-omni/.git/hooks/push-to-checkout.sample 2.72KB
mini-omni/.git/refs/heads/
mini-omni/.git/refs/tags/
mini-omni/.git/refs/remotes/
mini-omni/.git/objects/95/341e1d6ae4b5086e60f09e98f1a4ef42aca7fa 91B
mini-omni/.git/objects/68/f1da89b775caff935efc24d5241bd10eeb677c 126B
mini-omni/.git/objects/3b/7ba72c19d27aac21f7967459a282f92659d48b 273B
mini-omni/.git/objects/33/ea6c72ebb92a237fa2bdf26c5ff16592efcdae 2.2MB
mini-omni/.git/objects/b3/a695259f376d4eaf2e78d8d995ddf1fea57736 857B
mini-omni/.git/objects/da/6c66ccafbb524fe0d3f046053124239d67c75c 475B
mini-omni/.git/objects/f4/b55f917af273d0dc98b67ec249f6445dd385f5 489B
mini-omni/.git/objects/eb/09342a1d90c60ff41325e0030971ff25e9fecd 652B
mini-omni/.git/objects/7b/e5fc7f47d5db027d120b8024982df93db95b74 37B
mini-omni/.git/objects/2f/9d8d44c202347e18efe7ed842959c1f5b3b6f6 137.38KB
mini-omni/.git/objects/6e/a96fe583cf6200c1133368ad38011aac72eeeb 810B
mini-omni/.git/objects/98/96323e09177b32edcceedd565a086a91d029ab 852B
mini-omni/.git/objects/a0/dec2a89823c6136f481649b303d5905a31f866 234B
mini-omni/.git/objects/a7/22089c58869095607cb52d19b2f5a0c82cfe15 850B
mini-omni/.git/objects/a6/344aac8c09253b3b630fb776ae94478aa0275b 224B
mini-omni/.git/objects/46/a665ebf65e1fb9b3ea646bfd6dc20618137d5c 234B
mini-omni/.git/logs/refs/heads/
mini-omni/.git/logs/refs/remotes/
mini-omni/.git/refs/heads/main 41B
mini-omni/.git/refs/remotes/origin/
mini-omni/.git/logs/refs/heads/main 189B
mini-omni/.git/logs/refs/remotes/origin/
mini-omni/.git/refs/remotes/origin/HEAD 30B
mini-omni/.git/logs/refs/remotes/origin/HEAD 189B

资源介绍:

Mini-Omni 是一个开源多模型大型语言模型，可以一边听、一边说，一边思考。具有实时端到端语音输入和流音频输出对话功能。特征实时语音对话功能。无需额外的 ASR 或 TTS 模型。一边说话一边思考，能够同时生成文本和音频。流音频输出功能。通过“音频到文本”和“音频到音频”批量推理进一步提升性能。

--- license: mit language: - en base_model: Qwen/Qwen2-0.5B --- Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming ð¤ <a href="">Hugging Face</a> | ð <a href="https://github.com/gpt-omni/mini-omni">Github</a> | ð <a href="https://arxiv.org/abs/2408.16725">Technical report</a> Mini-Omni is an open-source multimodel large language model that can **hear, talk while thinking**. Featuring real-time end-to-end speech input and **streaming audio output** conversational capabilities. <img src="frameworkv3.jpg" width="100%"/> ## Features â **Real-time speech-to-speech** conversational capabilities. No extra ASR or TTS models required. â **Talking while thinking**, with the ability to generate text and audio at the same time. â **Streaming audio outupt** capabilities. â With "Audio-to-Text" and "Audio-to-Audio" **batch inference** to further boost the performance. **NOTE**: please refer to https://github.com/gpt-omni/mini-omni for more details.

标题	大小	时间
dlib-19.17.0-cp37-cp37m-win-amd64.whl	2.95MB	7月前

撮合交易的购售双方申报信息及节点通道信息	128.22KB	7月前
算法-动态规划-斐波那契模型	6.26MB	7月前
111	429.06KB	7月前
单片机基础仿真与创客拓展99.zip	14.5MB	7月前

2021_9_2Qt-main.zip	3.22MB	7月前
Android阶段性学习成果	32.65MB	7月前
人脸识别模型,解压后放置到工程public文件夹下	10.33MB	7月前