Home for the Development of MLX Vulkan backend
Results from running Qwen3-0.6B on AMD Radeon 8060S (Strix Halo):
Running warmup..
Timing with prompt_tokens=4096, generation_tokens=128, batch_size=1.
Trial 1: prompt_tps=1473.762, generation_tps=11.169, peak_memory=2.062
Averages: prompt_tps=1473.762, generation_tps=11.169, peak_memory=2.062
Benchmarks will be updated automatically by CI.