Weights are the millions or billions of adjustable numbers a model learns during training; they encode everything it knows. Parameter count is shorthand for scale — a 7-billion-parameter model is smaller, cheaper, and faster than a 70-billion one, usually at some accuracy cost.

