Learn more about computerscience with this collection
Understanding machine learning models
Improving data analysis and decision-making
How Google uses logic in machine learning
The above sorting using the key
parameter can be confusing to some people. Is there a cleaner way? Pandas is arguably the most versatile library for data processing, and you can expect that there is something neat to solve this relatively common problem — converting these lexicographically unordered columns to categorical data.
CategoricalDtype
by specifying the order of the months. 9
13 reads
MORE IDEAS ON THIS
As we’ve seen so far, every sorting is done using the ascending order, which is the default behavior. However, we often want to have the data sorted by a descending order. We can take advantage of the ascending
parameter.
What should we do if we sort by multiple columns and ha...
8
26 reads
In this article, we’ll be using the flights dataset, which records the monthly passenger...
8
34 reads
Apparently, the sorted data isn’t something that we expect — the months are not in the desired order. To make this happen, we can take advantage of the sort_method
taking a key
parameter, to which we can pass a custom function for sorting, just like Python’s built-in
8
17 reads
It’s important to remember that your datasets can always contain NANs. Unless you’ve examined your data quality and know that there are no NANs, you should pay attention to that. When we sort values, these NANs are placed behind all the other valid values, by default. If we want to change this de...
8
17 reads
We don’t always need one column for sorting. In many cases, we need to sort the data frame by multiple columns. It’s also simple with sort_values because by
doesn’t only take a single column but also a list of columns without any special syntax.
8
12 reads
In the previous sorting, you may notice that the index goes with each sorted row, which puzzles me sometimes, when I want the sorted DataFrame has an ordered index. In this case, you can either reset the index after sorting, or simply take advantage of the ignore_index
parameter, as ...
8
19 reads
In the previous sorting, one thing you may have notices is that the sort_values
method will create a new DataFrame
object, as shown below.
To avoid creating a new DataFrame, you can request the sorting to be done inplace by setting the inplace
paramet...
8
18 reads
Related collections
More like this
Pandas is a Python language package, which is used for data processing. This is a very common basic programming library when we use Python language for machine learning programming. This article is an introductory tutorial to it. Pandas provide fast, flexible and expressive data structures with t...
Read & Learn
20x Faster
without
deepstash
with
deepstash
with
deepstash
Access to 200,000+ ideas
—
Access to the mobile app
—
Unlimited idea saving & library
—
—
Unlimited history
—
—
Unlimited listening to ideas
—
—
Downloading & offline access
—
—
Personalized recommendations
—
—
Supercharge your mind with one idea per day
Enter your email and spend 1 minute every day to learn something new.
I agree to receive email updates