Professional Documents
Culture Documents
MiniCPM-2B: New Compact Multimodal LLM Outperforming The Giants
MiniCPM-2B: New Compact Multimodal LLM Outperforming The Giants
com/
Introduction
What is MiniCPM-2B?
On the other hand, the DPO version is fine-tuned with dynamic prompt
optimization (DPO) on the MTBench dataset, a benchmark that
simulates real-world user scenarios of LLMs. DPO is an innovative
technique that automatically learns the optimal prompts for different
tasks and domains, eliminating the need for human intervention.
MiniCPM-2B has many capabilities and use cases that can benefit
various users and applications. Here are some examples:
Conclusion
Source
Blog:
https://shengdinghu.notion.site/MiniCPM-Unveiling-the-Potential-of-End-side-Large-Language-Models-d4d3a8c426424654a4e8
0e42a711cb20
Github Repo: https://github.com/OpenBMB/MiniCPM/blob/main/README-en.md
Models: https://huggingface.co/openbmb/MiniCPM-V