The limits of AlphaGo or GPT-3

Although AI researchers can train systems to win at Space Invaders, it couldn’t play games like Montezuma Revenge where rewards could only be collected after completing a series of actions (for example, climb down ladder, get down rope, get down another ladder, jump over skull and climb up a third ladder).

For these types of games, the algorithms can’t learn because they require an understanding of the concept of ladders, ropes and keys. Something us humans have built in to our cognitive model of the world & that can’t be learnt by the reinforcement learning approach of DeepMind.

301 reads

CURATED FROM

Is Google’s AI research about to implode?

soccermatics.medium.com

3 ideas

721 reads

IDEAS CURATED BY

Vladimir Oane

@vladimir

Life-long learner. Passionate about leadership, entrepreneurship, philosophy, Buddhism & SF. Founder @deepstash.

The idea is part of this collection:

Learn more about philosophy with this collection

Centers of Progress

The historical significance of urban centers

The impact of cultural and technological advances

The role of urban centers in shaping society

Related collections

Lessons of History

The Mind of Leonardo da Vinci

Prelude to Paladin

The Easter Celebration

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

—

100+ Learning Journeys

—

Access to 200,000+ ideas

—

Access to the mobile app

—

Unlimited idea saving

—

Unlimited history

—

Unlimited listening to ideas

—

Downloading & offline access

—

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

I agree to receive email updates

deepstash

Content

Ideas

Collections

Stories

Explore

Product

Pricing

Businesses

Resources

Terms

Privacy

Press Kit

Sitemap

Company

About

Contact