We added a Specification section to every model page on ParaPulse

by

When you're evaluating a model on HuggingFace, you usually have to dig into the model card, check the repo files, or run the model locally just to find out basic architecture details — how many layers, what's the hidden size, what precision it runs at.

We just shipped a fix for that on ParaPulse.

What's new

Model detail pages now include a Specification section that surfaces the technical parameters directly — no repo digging required. If a model has configuration data available, you'll see things like:

- Architecture type and class

- Hidden size, layer count, intermediate (FFN) size

- Attention heads (Q and KV separately, so you can spot GQA/MQA at a glance)

- Max sequence length, vocabulary size

- Torch dtype (bfloat16 / float16 / float32)

- Activation function, RoPE theta, normalization epsilon

- MoE parameters (total experts, active experts per token) for models like Qwen3-MoE

It handles the full range of HuggingFace model types — transformers LLMs, BERT-style encoders, Whisper/audio models, timm vision models, CLIP, and ResNet-style CNNs — not just the big LLMs.

Why it matters

When you're choosing between five similar models, knowing that one uses GQA with 8 KV heads vs full MHA, or that one caps out at 32K tokens vs 128K, changes the decision. That information was always in the repo — it just wasn't surfaced anywhere useful.

The model list also now shows a small indicator icon next to models that have spec data available, so you know before you click.

Would love feedback on what other parameters would be useful to surface. There's a lot more in a typical config file than we show by default.

6 views

Add a comment

Replies

Be the first to comment