InternVL3 is an advanced series of multimodal large language models (MLLMs) that demonstrates outstanding overall performance. Compared with InternVL 2.5, InternVL3 delivers superior multimodal perception and reasoning capabilities, while further expanding its multimodal abilities to cover areas such as tool usage, GUI agents, industrial image analysis, and 3D visual perception.
The base model provides a context window of 1024, with a maximum output of 1280 tokens.
Supported platforms: LLM630 Compute Kit, Module LLM, and Module LLM Kit
apt install llm-model-internvl3-1b-448-ax630c The base model provides a context window of 2048, with a maximum output of 2048 tokens.
Supported platform: AI Pyramid
apt install llm-model-internvl3-1b-448-ax630c