Qwen3.6 35B A3B

Qwen3.6 35B A3B inference overview

Price per 1M tokens

$0.25 (input)

$1.25 (output)

Parameters

3B (active)

35B (total)

Context Window

262K

Release Date

Apr 2026

Qwen3.6 35B A3B inference details

Qwen3.6-35B-A3B is a 35B-parameter MoE multimodal model with 3B active parameters and a 262K native context window. As the MoE variant in the Qwen3.6 family, its architecture makes it ideal for efficient agentic coding and high-volume workflows.

Created by:

Alibaba

License:

apache-2.0

Model card:

Qwen3.6-35B-A3B

				
					import openai
import weave

# Weave autopatches OpenAI to log LLM calls to W&B
weave.init("<team>/<project>")

client = openai.OpenAI(
    # The custom base URL points to W&B Inference
    base_url='https://api.inference.wandb.ai/v1',

    # Get your API key from https://wandb.ai/authorize
    # Consider setting it in the environment as OPENAI_API_KEY instead for safety
    api_key="<your-apikey>",

    # Optional: Team and project for usage tracking
    project="<team>/<project>",
)

response = client.chat.completions.create(
    model="Qwen/Qwen3.6-35B-A3B",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Tell me a joke."}
    ],
)

print(response.choices[0].message.content)

Qwen3.6 35B A3B resources

Guide

Running Qwen3 Coder on W&B Inference

Course

AI engineering course: Agents

Guide

Qwen3.6 35B A3B inference overview

Price per 1M tokens

Parameters

Context Window

Release Date

Qwen3.6 35B A3B inference details

Qwen3.6 35B A3B resources

The Platform

Article

Resources

Company

Use cases

Industries

Learn more

Qwen3.6 35B A3B inference overview

Price per 1M tokens

Parameters

Context Window

Release Date

Qwen3.6 35B A3B inference details

Qwen3.6 35B A3B resources

The Platform

Article

Resources

Company

Use cases

Industries