Pandas Technique-Subsetting
import pandas as pd
import numpy as np
Read Dataset
df = pd.read_csv('Srt_dta.csv')df
data:image/s3,"s3://crabby-images/f795d/f795d05d29f03f0ae5e83dd3e2aa45648f501f5e" alt=""
Subsetting columns
To select a single column, use square brackets [] with the column name of the column of interest.
df['Name']
data:image/s3,"s3://crabby-images/c2e21/c2e2157119920cccbf3d51ea4aa8b1b9a0ac44b9" alt=""
Subsetting multiple columns
# method 1
df[["Breed","Height(cm)"]]
data:image/s3,"s3://crabby-images/1d87b/1d87bbad7bbd480cc6dd6125b3b305289e2ce840" alt=""
# method 2
cols_to_subset = ["Breed","Height(cm)"]
df[cols_to_subset]
data:image/s3,"s3://crabby-images/7b320/7b3205596938d5fe5e679fbdff830911d0d6f939" alt=""
Subsetting rows
This return boolean value.
df["Height(cm)"] > 50
data:image/s3,"s3://crabby-images/5a65a/5a65a1f359ee348655ff355989ed348e8d562abd" alt=""
# This return numeric value
df[df["Height(cm)"] > 50]
data:image/s3,"s3://crabby-images/558ef/558efbbcce249b40eac80526e7dd16003dcd0ff4" alt=""
Subsetting based on text data
df[df["Breed"] > '2015-01-01']
data:image/s3,"s3://crabby-images/ea41a/ea41aba13ed4b3804af0725a36a2e646fd3339c9" alt=""
Subsetting based on multiple conditions
is_lab = df['Breed'] == 'Labrador'
is_black = df['Color'] == 'Black'
df[is_lab & is_black]
data:image/s3,"s3://crabby-images/6deaa/6deaa9127a7664455abdd2e0b560307c87374bb9" alt=""
Subsetting using .isin()
Pandas isin() method is used to filter data frames. isin() method helps in selecting rows with having a particular(or Multiple) value in a particular column. Parameters: values: iterable, Series, List, Tuple, DataFrame or dictionary to check in the caller Series/Data Frame.
is_black_or_brown = df['Color'].isin(['Black', 'Brown'])
df[is_black_or_brown]
data:image/s3,"s3://crabby-images/2f39a/2f39a208b165ad9d2d964a2db4be78dddd18abc2" alt=""
Comments