top of page
learn_data_science.jpg

Data Scientist Program

 

Free Online Data Science Training for Complete Beginners.
 


No prior coding knowledge required!

Writer's picturetasnim assali

Importing Data in Python

In this blog, I'm going to show a tutorial on how to import data in python for different types of data files. Python offers modules that help to import different data with various files formats.

First, I going to import data with CVS format as shown in the code below.

The CSV format enables us to read each row in the file using a comma as a delimiter.


In the beginning, I open the file in "read-only" mode, affect the delimiter, and then I used a for loop to read each row from the CSV file.


import csv

with open("C:/Users/asus/Desktop/train.csv,'r') as custfile:
rows=csv.reader(custfile,delimiter=',')
for r in rows:
print(r)

Also, there is another method to import .csv files by using pandas as shown below:


import pandas as pd
import numpy as np
data = pd.read_csv("C:/Users/asus/Desktop/train.csv", index_col="Loan_ID")
data.head(10)

First, I import pandas and numpy, second, load the data with pd.read_csv module, then show the first 10 rows and the result shown below:


The second file format I import with python is .txt format. as shown in the next code. First I import the NumPy library, load the file using loadtxt module.

import numpy as np
filename = 'MNIST_header.txt'
data = np.loadtxt(filename, delimiter=',', skiprows=1, dtype=str)
print(data)

Pandas library can handle excel files using the read_excel module. As shown below the example that imports data from an excel file.

First, I import the pandas library, import the excel file by using the pd.ExcelFile module the use df.parse module with specifying "loan_ID"

and finally, I showed the first 10 lines.

import pandas as pd

df = pd.ExcelFile("C:/Users/asus/Desktop/train.xlsx")
data=df.parse("Loan_ID")
print(data.head(10))

Also in python, we can connect to database servers using a module called pyodbc. This method offers to us the opportunity to import data from relational sources using a SQL query. As shown below:

import pyodbc
sql_conn = pyodbc.connect("Driver={SQL Server};Server=serverName;UID=UserName;PWD=Password;Database=sqldb;")
data_sql = pd.read_sql_query(SQL QUERY’, sql_conn)
data_sql.head()
0 comments

Recent Posts

See All

Comments


bottom of page