Directly examine your raw data - Deepstash

Directly examine your raw data

ML models will reflect the data they are trained on, so analyze your raw data carefully to ensure you understand it.

  • Does your data contain any mistakes (e.g., missing values, incorrect labels)?
  • Is your data sampled in a way that represents your users and real-world settings?
  • Are any features in your model redundant or unnecessary? 
  • If you are using a data label X as a proxy to predict a label Y, in which cases is the gap between X and Y problematic?

36

318 reads

CURATED FROM

IDEAS CURATED BY

anikad

Life Is A Marathon| Life Lover

The idea is part of this collection:

Machine Learning With Google

Learn more about philosophy with this collection

Understanding machine learning models

Improving data analysis and decision-making

How Google uses logic in machine learning

Related collections

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

100+ Learning Journeys

Access to 200,000+ ideas

Access to the mobile app

Unlimited idea saving

Unlimited history

Unlimited listening to ideas

Downloading & offline access

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

Email

I agree to receive email updates