
Important Pandas Techniques in Python
Apply a function to apply a function along an axis of the DataFrame we use: pandas.DataFrame.apply DataFrame.apply(func, axis=0,...
Mariam Ahmed
3 min read


Cleaning Data in Python
Cleaning The Dataset Data cleaning is important because, if data is wrong or contains missing values, outcomes and algorithms will be...
Mariam Ahmed
3 min read

TIME SERIES ANALYSIS OF NAICS
NAICS The North American Industry Classification System (NAICS) is the standard used by Federal statistical agencies in classifying...
James Owusu-Appiah
8 min read

Character Locator in Python
Given a string and a character, your task is to locate the positions of that character in the string. These types of problems are very...
Mariam Ahmed
1 min read


Simple Guide to write a BMI Calculator app in Python
What is BMI? The Body Mass Index (BMI) is a measurement of a person's leanness or corpulence based on their height and weight and is...
Mariam Ahmed
2 min read


Lists in Python
The list is a data structure in Python that is a mutable, ordered sequence of elements. It is one of 4 built-in data types in Python. It...
Mariam Ahmed
3 min read

Pandas Data Frame
A Data frame is a two-dimensional data structure. Data Frame is a module in Pandas Library, so to use it for the first time we should...
Mariam Ahmed
2 min read
Pandas Techniques for Data Manipulation in Python
Pandas is a great library for data Manipulation, it offers many tools to analyse data. Pandas is made up by numpy and matplotlib, so it...
Wilson Waha
2 min read

Analysis Of NAICS Timeseries
The North American Industry Classification System (NAICS) is an industry classification system developed by the statistical agencies of...
aya abdalsalam
3 min read

Analysis of NAICS
I consider myself a Data Analyst. The head of my department at my new company has given me the following instructions: 1. Download zipped...
Ahmed Shebl
5 min read

Extreme Gradient Boosting with XGBoost and Cluster Analysis in Python
Gradient boosting is currently one of the most popular techniques for the efficient modeling of tabular datasets of all sizes. XGboost is...
Tanushree Nepal
3 min read
Rock Paper and Scissor
From an early age, we have been playing Rock Paper, and scissors with our friends mostly either in a duel or to make a group in a...
Sabin Panta
1 min read


Machine Learning for Time Series Data and Feature Engineering for Machine Learning in Python
Machine Learning for Time Series Data in Python Real-world data sets are essential for developing and testing machine learning models....
Arpan Sapkota
8 min read
Python for Email Username Scrapping
Many times on our journey to learn to program, the tutorial might get hectic and you wanna do something for fun. For fun, I mean to use...
Sabin Panta
2 min read


A gentle introduction to some concepts of Time Series Analysis
Prediction sometimes involves time series. Time series analysis can be both done via supervised and unsupervised machine learning. In...
Ntandoyenkosi Matshisela
3 min read

North American Industry Classification System NACIS Time Series Analysis
The North American Industry Classification System or NAICS is a classification of business establishments by type of economic activity...
Amr Mohamed Salama
3 min read

North American Industry Classification System NACIS Time Series Analysis
The North American Industry Classification System or NAICS is a classification of business establishments by type of economic activity...
Amr Salama
4 min read
Where Function in Pandas
The where() in pandas and NumPy is used to filter Data Frames based on a condition. We will discuss this important topic in this tutorial....
Ebrima Sise
2 min read


Let's deal with some text data
Back in recent posts, I mentioned one indisputable fact about the current machine learning models. They deal with numbers. In the case of...
Hamza kchok
6 min read
Detect and Impute Missing Values in Pandas
Hardly you will come across a dataset that is free from missing values. Missing values can be encoded in a number of ways. Sometimes when...
Ebrima Sise
4 min read

Data Preparation in Python
Data Preparation involves the manipulation and consolidation of raw data from different sources into a standardized format so that it can...
Gaurab Awal
2 min read
Adding Columns to a Data Frame
In this tutorial, we will practice adding column(s) to a Data Frame. But first, we need to import pandas and NumPy to be able to...
Ebrima Sise
2 min read
Apply Function in Pandas
The apply function lets you run a function on all the elements of a pandas series/column. It lets you apply a function on either the rows...
Ebrima Sise
2 min read

Co-Variance Matrix in detail
Why is it important? Covariance: There is a strong relationship between Covariance and correlation The concept of variance: s^2 is a...
Sana Omar
3 min read