Attention

Definition

The mechanism that lets a model weigh which parts of the input matter most for each output. The core idea that makes transformers work.

Attention scores how much each input element should influence each output element, so the model focuses on the relevant words or image patches. It is what gives transformers their grasp of context, and its compute cost shapes how long inputs can be.

Also known as

self-attention, attention mechanism

Specialist software house for video, real-time and AI products. Founded 2005. 50 in-house engineers.

Knowledge base

Blog Guides Courses Glossary Downloads

Company

Services Projects Demos Calculator Contacts

+852-8193-2621

Hong Kong

+1 (914) 775-5855

New York · USA

eager2develop@forasoft.com

Your message has been sent successfully

We will contact you soon

Message not sent. Please try again.

Attention

Related terms

Transformer

Context window

KV cache