The AlphaGo Milestone - Deepstash
Machine Learning With Google

Learn more about artificialintelligence with this collection

Understanding machine learning models

Improving data analysis and decision-making

How Google uses logic in machine learning

Machine Learning With Google

Discover 95 similar ideas in

It takes just

14 mins to read

The AlphaGo Milestone

The AlphaGo Milestone

In March 2016, the Reinforcement Learning technique had a landmark moment. 

A DeepMind system called AlphaGo became the first computer program to defeat a world champion in Go, a famously complex board game.

The victory was reportedly watched by over 200 million people.

AlphaGo learns the game from scratch by playing against different versions of itself thousands of times, incrementally learning through a process of trial and error, known as reinforcement learning. This means it is free to learn the game for itself, unconstrained by orthodox thinking.

11

19 reads

MORE IDEAS ON THIS

How a Reward Function Works

How a Reward Function Works

In AI systems, the rewards and punishments are calculated mathematically. A self-driving system could receive a -1 when the model hits a wall, and a +1 if it safely passes another car. These signals allow the agent to evaluate its performance.

The algorithm then learns through trial and err...

9

21 reads

The Bottom Line

The Bottom Line

There are still major challenges to overcome. RL agents struggle to maximize rewards in complex environments and assess the long-term repercussions of their actions. Nonetheless, the reward-is-enough proponents believe the algorithms’ adaptability could pave a path to AGI.

9

22 reads

Reinforcement Learning

Reinforcement Learning

In reinforcement learning (RL), a software agent learns through trial and error. When it takes the desired action, the model receives a reward.

Over time, the agent works out how to execute the task to optimize its reward.

The technique can be applied to a vast array ...

9

77 reads

CURATED FROM

CURATED BY

theodorexh

There is a difference between patience & procrastination.

Related collections

More like this

Foster Curiosity

Consider a child’s capacity for learning. Help your employees see the unlearning from a child’s view. A child is usually open to discover new approaches and new techniques without much hesitation at all. They don't have the same 'adult baggage' of fear or looking stupid. Ask yourself:

What ...

The limits of AlphaGo or GPT-3

Although AI researchers can train systems to win at Space Invaders, it couldn’t play games like Montezuma Revenge where rewards could only be collected after completing a series of actions (for example, climb down ladder, get down rope, get down another ladder, jump over skull and climb up a ...

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Access to 200,000+ ideas

Access to the mobile app

Unlimited idea saving & library

Unlimited history

Unlimited listening to ideas

Downloading & offline access

Personalized recommendations

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

Email

I agree to receive email updates