The limits of AlphaGo or GPT-3 - Deepstash

The limits of AlphaGo or GPT-3

Although AI researchers can train systems to win at Space Invaders, it couldn’t play games like Montezuma Revenge where rewards could only be collected after completing a series of actions (for example, climb down ladder, get down rope, get down another ladder, jump over skull and climb up a third ladder).

For these types of games, the algorithms can’t learn because they require an understanding of the concept of ladders, ropes and keys. Something us humans have built in to our cognitive model of the world & that can’t be learnt by the reinforcement learning approach of DeepMind.

18

295 reads

CURATED FROM

IDEAS CURATED BY

vladimir

Life-long learner. Passionate about leadership, entrepreneurship, philosophy, Buddhism & SF. Founder @deepstash.

The idea is part of this collection:

Centers of Progress

Learn more about philosophy with this collection

The historical significance of urban centers

The impact of cultural and technological advances

The role of urban centers in shaping society

Related collections

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

100+ Learning Journeys

Access to 200,000+ ideas

Access to the mobile app

Unlimited idea saving

Unlimited history

Unlimited listening to ideas

Downloading & offline access

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

Email

I agree to receive email updates