Skip to content

Metal CI: Add Qwen 3.5 MoE tiny model integration test#19042

Merged
manuelcandales merged 1 commit intomainfrom
manuel/metal-ci-qwen-moe-tiny
Apr 22, 2026
Merged

Metal CI: Add Qwen 3.5 MoE tiny model integration test#19042
manuelcandales merged 1 commit intomainfrom
manuel/metal-ci-qwen-moe-tiny

Conversation

@manuelcandales
Copy link
Copy Markdown
Contributor

Export the tiny model with --backend metal, build the C++ runner, and verify both decode (T=1) and prefill (T>2) complete successfully. Uses a byte-level tokenizer matching the tiny model's vocab_size=256.

Authored with Claude.

Copilot AI review requested due to automatic review settings April 22, 2026 03:29
@pytorch-bot
Copy link
Copy Markdown

pytorch-bot Bot commented Apr 22, 2026

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19042

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

❌ 3 New Failures, 1 Cancelled Job, 6 Pending, 4 Unrelated Failures

As of commit 2330a5a with merge base 6be4fb5 (image):

NEW FAILURES - The following jobs have failed:

CANCELLED JOB - The following job was cancelled. Please retry:

FLAKY - The following job failed but was likely due to flakiness present on trunk:

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 22, 2026
@github-actions
Copy link
Copy Markdown

This PR needs a release notes: label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new Metal CI integration test job that exports and runs the Qwen 3.5 MoE tiny model on macOS Metal, aiming to validate both prefill and decode execution paths.

Changes:

  • Extend Metal workflow PR path filters to include the Qwen3.5 MoE example and LLM export code.
  • Add a new test-metal-qwen35-moe-tiny job that installs ExecuTorch, exports a tiny Qwen3.5 MoE model for Metal, builds the runner, generates a minimal tokenizer, and runs two inference scenarios.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread .github/workflows/metal.yml Outdated
Comment thread .github/workflows/metal.yml Outdated
Comment thread .github/workflows/metal.yml
Export the tiny model with --backend metal, build the C++ runner,
and verify both decode (T=1) and prefill (T>2) complete successfully.
Uses a byte-level tokenizer matching the tiny model's vocab_size=256.

Authored with Claude.
@manuelcandales manuelcandales merged commit 84e1aed into main Apr 22, 2026
462 of 481 checks passed
@manuelcandales manuelcandales deleted the manuel/metal-ci-qwen-moe-tiny branch April 22, 2026 20:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci/pytorch-pin-bump ciflow/trunk CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants