Arm backend: Add Qwen3 VL E2E coverage#20274
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20274
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ❌ 6 New Failures, 3 Pending, 2 Unrelated Failures, 5 Unclassified FailuresAs of commit 128a4bc with merge base 5563ee9 ( NEW FAILURES - The following jobs have failed:
UNCLASSIFIED FAILURES - DrCI could not classify the following jobs because the workflow did not run on the merge base. The failures may be pre-existing on trunk or introduced by this PR:
FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
|
|
@pytorchbot label "partner: arm" |
|
@pytorchbot label "release notes: arm" |
|
@claude review this PR |
|
Claude finished @bdemirb's task in 3m 9s —— View job Review: Qwen3 VL E2E coverage
Overall this is a clean, well-structured addition that mirrors the existing 1. Non-deterministic inputs + relaxed tolerances → likely flaky (most important)All weights and inputs are random with no seed set anywhere ( Recommend seeding at the top of each 2. Substantial duplication with
|
Add full Qwen3 VL language and vision model tests for the Arm backend in FP32 and BF16 modes. Cover both TOSA and VGF no-quant paths, with BF16 VGF using an explicit FP profile that advertises BF16 support. Relax the FP32 language-model tolerance to match observed TOSA reference drift for the full decoder stack. Signed-off-by: Baris Demir <baris.demir@arm.com> Change-Id: I1fa6ffde632e0b252c4c19a95a854268ae01ba5a
e851ac7 to
128a4bc
Compare
zingo
left a comment
There was a problem hiding this comment.
OK to merge if tests are OK, just make sure there is no buck2 related changes needed for the added files (you can ask codex or calude about it or review the buck2 config files by hand)
Add full Qwen3 VL language and vision model tests for the Arm backend in FP32 and BF16 modes.
Cover both TOSA and VGF no-quant paths, with BF16 VGF using an explicit FP profile that advertises BF16 support.
Relax the FP32 language-model tolerance to match observed TOSA reference drift for the full decoder stack.
cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell @rascani