Over And Under Sampling

Sometimes if we want to compare two datasets, or classify datasets that have an uneven number of samples for different sides or types. Just by taking fewer samples (undersampling), one can even out a dataset.

Oversampling is a way to copy datasets to have the same number of examples as the other class. The copies are produced maintaining the distribution ratio.

119

398 reads

CURATED FROM

The 5 Basic Statistics Concepts Data Scientists Need to Know

towardsdatascience.com

7 ideas

4.62K reads

IDEAS CURATED BY

Cameron

@camz

Everyone you meet has something to teach you.

The idea is part of this collection:

Learn more about problemsolving with this collection

Behavioral Economics, Explained

How to make rational decisions

The role of biases in decision-making

The impact of social norms on decision-making

Related collections

The Art of Decision-Making

Productivity Systems

How To Study Effectively For Exams

7 Books on Habits

Read & Learn

20x Faster

without
deepstash

with
deepstash

with

deepstash

Personalized microlearning

—

100+ Learning Journeys

—

Access to 200,000+ ideas

—

Access to the mobile app

—

Unlimited idea saving

—

Unlimited history

—

Unlimited listening to ideas

—

Downloading & offline access

—

Supercharge your mind with one idea per day

Enter your email and spend 1 minute every day to learn something new.

I agree to receive email updates

deepstash

Content

Ideas

Collections

Stories

Explore

Product

Pricing

Businesses

Resources

Terms

Privacy

Press Kit

Sitemap

Company

About

Contact