The amount of computational power used by an AI model when its generating a response or performing a task after its been trained.