James Owusu-Appiah
5 min read
QS World Ranked Universities Analysis
QS WORLD RANKED UNIVERSITIES ANALYSIS This is in fulfillment of my final project as part of Data Insight Data Scientist Program. My final...
James Owusu-Appiah
4 min read
Linear Classifiers And Machine Learning With Tree Based Models In Python
LINEAR CLASSIFIERS Linear classifiers are supervised machine learning algorithms that classify data into labels based on linear...
James Owusu-Appiah
7 min read
Supervised and Unsupervised Learning
What Is Supervised Learning? A function that maps an input to an output is learned through supervised learning using sample input-output...
James Owusu-Appiah
7 min read
Understanding Fundamental Statistical Concepts In Data Science
What Is Statistics? Statistics is a form of mathematical analysis that uses quantified models and representations for a given set of...
James Owusu-Appiah
8 min read
TIME SERIES ANALYSIS OF NAICS
NAICS The North American Industry Classification System (NAICS) is the standard used by Federal statistical agencies in classifying...
asma kirli
7 min read
My roadmap into preprocessing data: Feature extraction from raw text using TF-IDF
“ Without big data, you are blind and deaf and in the middle of a freeway” - Geoffrey Moore. When we say big data, we mean a huge amount...
James Owusu-Appiah
4 min read
IMPORTING AND CLEANING DATA IN PYTHON
Importing Data In Python We need datasets for data analysis. Python has various modules which help us in importing the external data in...
Tanushree Nepal
5 min read
Exploratory Data Analysis: Importing, Cleaning, and Visualization of Titanic Dataset
Exploratory Data Analysis (EDA) is used by data scientists to analyze and investigate data sets and summarize their main characteristics,...
Umme Rubaiyat Chowdhury
5 min read
Importing cleaning and Visualizing data using python
In this instructional exercise, we’ll utilize Python’s Pandas and NumPy libraries to clean data. In the primary portion I will clarify...
Blessing Oluwadaara
15 min read
Pandas Techniques in Python for Data Manipulation
Introduction Python is the language of choice for data scientists. It provides a programming language's larger ecosystem as well as the...