Skip to main content

Optimize and benchmark

Edge Studio's core loop: analyze a model, apply optimization, benchmark the result, repeat until the candidate fits your target device.

Analysis

Before optimizing, understand the model:

ToolWhat it showsWhen to use
Architecture browserLayer tree, parameter counts, typesFirst load — understand structure
Weight analysisTensor sizes, data types, distributionsIdentify what dominates memory
Activation heatmapLayer-level activation magnitudesAfter profiling — find hotspots
Attention patternsHead importance, attention tracesDebug generation quality
KV cache analysisMemory projections for conversation lengthPlan multi-turn memory budget
MoE analyzerExpert routing and utilizationMoE models only
Inference tracerToken probabilities, step timingDebug specific outputs
Model comparisonSide-by-side original vs optimizedAfter optimization

Optimization

ToolWhat it doesBest for
Auto optimizerSearches candidates automaticallyQuick start — let Studio decide
Optimization pipelineStep-by-step manual controlPrecise tuning of each stage
Pruning simulatorPreview size reduction before applyingEstimating if a target is realistic
Mixed precisionPer-layer quantization bit-widthFine-grained quality/size balance
DistillationTrain a smaller student from a teacherCreating compact models
MergeCombine compatible model sourcesAssembling from multiple sources
Auto tuneSearch inference parametersFinding optimal device config

Quality validation

After every optimization pass:

  1. Run the Quality validator — perplexity check, full report, or custom prompts.
  2. Compare with the original in Model comparison.
  3. Test in Chat with your real use-case prompts.
  4. Do not export until validation passes.

Batch operations

Benchmark dashboard — Run benchmarks across multiple models. Plotly charts, CSV export, side-by-side comparison.

Batch operations — Queue multiple models for optimization. Progress tracking, failure post-mortem.

Use batch tools when evaluating a model catalog or running regression checks across candidates.

Next steps