v0.27.2:0 - vision check tool + mark Qwen3.6 vision-capable

Qwen3.6-35B-A3B is multimodal (vision tower on disk) but was labelled
text-only. Mark it [vision, reasoning] and add a 'Vision check' button on
the running vision-capable card: upload an image + prompt -> existing /v1
passthrough proxy -> show the model's text. Confirmed 7/7 fields on a
business card. Records the Gemma-4-26B deferral + research findings.
This commit is contained in:
Keysat
2026-06-18 18:14:30 -05:00
parent c846386c1a
commit 9a3bf9ed86
6 changed files with 120 additions and 5 deletions
+4 -1
View File
@@ -96,7 +96,10 @@ models:
repo: RedHatAI/Qwen3.6-35B-A3B-NVFP4
size_gb: 20
mode: solo
capabilities: [reasoning]
# Qwen3.6-35B-A3B is natively multimodal (Qwen3_5MoeForConditionalGeneration,
# vision tower ships in the checkpoint). Confirmed reading a business card
# cleanly on this cluster — use the "Vision check" button on the live card.
capabilities: [vision, reasoning]
expected_ready_seconds: 300
vllm_args:
- --gpu-memory-utilization=0.85