GPU Python packages
that just install.

Pre-built CUDA wheels for flash-attn, xformers, vllm, deepspeed, and 9 more packages.
One pip command. No compiling.

bash
$ pip install flash-attn \
--extra-index-url https://ew_xxx:@easywheels.io/simple/ \
--prefer-binary
Collecting flash-attn
Downloading flash_attn-2.7.0-cp312-cu124-linux_x86_64.whl (48.2 MB)
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 48.2/48.2 MB 3.2s
Successfully installed flash-attn-2.7.0
Get Your Free API Key

14-day free trial. No credit card required.

flash-attn flash-attn-3 xformers vllm deepspeed mamba-ssm causal-conv1d llama-cpp-python exllamav2 gptqmodel torchao sageattention flashinfer flash-attn flash-attn-3 xformers vllm deepspeed mamba-ssm causal-conv1d llama-cpp-python exllamav2 gptqmodel torchao sageattention flashinfer

You know the drill.

45+ minutes

Average compile time for flash-attn from source. Per environment. Per CUDA version.

CUDA mismatch

Wrong toolkit version? Incompatible driver? Start over from scratch.

cmake errors

Missing headers, broken builds, cryptic C++ template errors three screens long.

45 min

Build from source

9 seconds

EasyWheels

flash-attn 2.8.3 on Python 3.12. Measured on a real install.

Every ML developer has lost an afternoon to this. We built EasyWheels so you never have to again.

Built by ML engineers,
for ML engineers.

01

Add our index

$ pip install flash-attn \
--extra-index-url https://YOUR_KEY:@easywheels.io/simple/ \
--prefer-binary

One-time setup. Works with any pip workflow.

02

Install as usual

$ pip install flash-attn

We auto-detect your Python version, CUDA toolkit, and GPU architecture.

03

Done in seconds

flash-attn 2.7.0 installed (3.2s)

Pre-compiled wheel. No cmake. No waiting.

Start free. Scale when you're ready.

14-day trial. 5 downloads + 1 custom build. No credit card required.

Early supporter rates — locked in for founding members. Prices will increase as the registry grows.

Trial

$0 / 14 days
  • Full registry access
  • 5 downloads included
  • 1 custom build included
Start Free Trial

Lite

$9 /mo
  • 15 downloads / month
  • Registry access
  • Email support
Subscribe
RECOMMENDED

Pro

$19 /mo
  • Unlimited downloads
  • 3 custom builds / month
  • Priority support
Subscribe

Team

$49 /mo
  • Unlimited downloads
  • 10 custom builds / month
  • 5 team seats included
  • Dedicated support
Subscribe

Need more? Enterprise plans start at $199/mo. Contact us →

Or pay per download — $2/wheel, no subscription needed.

Common questions.