How a Reward Function Works - Deepstash
How a Reward Function Works

How a Reward Function Works

In AI systems, the rewards and punishments are calculated mathematically. A self-driving system could receive a -1 when the model hits a wall, and a +1 if it safely passes another car. These signals allow the agent to evaluate its performance.

The algorithm then learns through trial and error to maximize the reward — and ultimately, complete the task in the most desirable manner.

10

24 reads

CURATED FROM

IDEAS CURATED BY

theodorexh

There is a difference between patience & procrastination.

The idea is part of this collection:

Machine Learning With Google

Learn more about artificialintelligence with this collection

Understanding machine learning models

Improving data analysis and decision-making

How Google uses logic in machine learning

Related collections

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

100+ Learning Journeys

Access to 200,000+ ideas

Access to the mobile app

Unlimited idea saving

Unlimited history

Unlimited listening to ideas

Downloading & offline access

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

Email

I agree to receive email updates