Curated from: towardsdatascience.com
Ideas, facts & insights covering these topics:
7 ideas
·4.58K reads
48
Explore the World's Best Ideas
Join today and uncover 100+ curated journeys from 50+ topics. Unlock access to our mobile app with extensive features.
Statistics is using math to do technical analysis of data. Instead of guesstimating, data helps us get concrete and factual information.
The most widely used statistical concept in data science is called Statistical Features. It includes important measurements like bias, variance, mean, median and percentiles. It’s all code-friendly too.
148
1.77K reads
A typical data set diagram (box plot) carries a lot of information.
119
574 reads
In data science, probability is the percent chance that something will happen. A zero(0) in this case means the event will not occur, while the digit 1 denotes that we are certain it will happen.
123
592 reads
The common probability distributions are:
125
402 reads
The process of reduction in the number of dimensions (or feature variables) in datasets is known as Dimensionality Reduction.
If a cube has 1000 points, we can reduce its dimensionality by simply taking the 3D data and viewing it as a 2D model. We can also remove feature variables to reduce the data volume. This is generally done with features that have a low correlation with the dataset and is called feature pruning.
122
376 reads
Sometimes if we want to compare two datasets, or classify datasets that have an uneven number of samples for different sides or types. Just by taking fewer samples (undersampling), one can even out a dataset.
Oversampling is a way to copy datasets to have the same number of examples as the other class. The copies are produced maintaining the distribution ratio.
118
394 reads
Based on the concept of probability, Bayesian Statistics computes and analyzes prior data to forecast the future trend. If there is a specific change in the present, the prior data will not reflect that.
Frequency analysis, therefore, is computing the likelihood of a specific occurrence, where new information isn’t computed.
127
471 reads
IDEAS CURATED BY
Learn more about problemsolving with this collection
How to make rational decisions
The role of biases in decision-making
The impact of social norms on decision-making
Related collections
Similar ideas
3 ideas
How Risk Analysis Works
investopedia.com
8 ideas
6 Math Foundation to Start Learning Machine Learning
towardsdatascience.com
Read & Learn
20x Faster
without
deepstash
with
deepstash
with
deepstash
Personalized microlearning
—
100+ Learning Journeys
—
Access to 200,000+ ideas
—
Access to the mobile app
—
Unlimited idea saving
—
—
Unlimited history
—
—
Unlimited listening to ideas
—
—
Downloading & offline access
—
—
Supercharge your mind with one idea per day
Enter your email and spend 1 minute every day to learn something new.
I agree to receive email updates