A token is roughly a few characters of text or a fixed piece of an image. Models process input and produce output token by token, and providers bill per thousand tokens. Estimating tokens per request is how you forecast the monthly cost of an AI feature.