0% found this document useful (0 votes)
32 views

Data Transfer Between Files, SQL Databases & Dataframes: Comma To Separate Each Specific Data Value. CSV Advantages

CSV (comma-separated values) files store tabular data in plain text format. Each row of data is on a new line with fields separated by commas. CSV files are widely used for data interchange but have limitations like a lack of data types and no standard for special characters. Python's pandas library can be used to read CSV files into DataFrames for analysis and to write DataFrames out to CSV format.

Uploaded by

Kriti Saxena
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views

Data Transfer Between Files, SQL Databases & Dataframes: Comma To Separate Each Specific Data Value. CSV Advantages

CSV (comma-separated values) files store tabular data in plain text format. Each row of data is on a new line with fields separated by commas. CSV files are widely used for data interchange but have limitations like a lack of data types and no standard for special characters. Python's pandas library can be used to read CSV files into DataFrames for analysis and to write DataFrames out to CSV format.

Uploaded by

Kriti Saxena
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Data Transfer between

Files, SQL Databases & DataFrames


CSV file (Comma Separated Values file) is a type of plain text file that uses specific
structuring to arrange tabular data. Because it’s a plain text file, it can contain only
actual text data—in other words, printable ASCII or Unicode characters.

The structure of a CSV file is given away by its name. Normally, CSV files use a
comma to separate each specific data value.

CSV Advantages

 CSV is human readable and easy to edit manually


 CSV is simple to implement and parse
 CSV is processed by almost all existing applications. (Nearly all
spreadsheets and database support import/export to CSV format)
 CSV format is common for data interchange.
 CSV provides a straightforward information schema
 CSV is faster to handle
 CSV is smaller in size
 CSV is considered to be standard format
 CSV is compact. For XML you start tag and end tag for each column in each
row. In CSV you write the column headers only once.
 CSV is easy to generate
CSV Disadvantages

 CSV allows moving most basic data only. Complex configurations cannot
be imported and exported this way
 There is no distinction between text and numeric values
 No standard way to represent binary data
 Problems with importing CSV into SQL (no distinction between NULL and
quotes)
 Poor support of special characters
 No standard way to represent control characters
 Lack of universal standard
The CSV file format is not fully standardized. The basic idea of separating fields with
a comma is clear, but that idea gets complicated when the field data may also contain
commas or even embedded line-breaks. CSV implementations may not handle such
field data, or they may use quotation marks to surround the field. Quotation does not
solve everything: some fields may need embedded quotation marks, so a CSV
implementation may include escape characters or escape sequences.

www.pythonclassroomdiary.wordpress.com by Sangeeta M Chauhan pg. 1


1. HANDLING (READING / WRITING ) DATA OF CSV FILE:
1.1 USING NOTEPAD OR ANY TEXT EDITOR To create a CSV file
with a text editor, first choose your favorite text editor, and open a
new file. Then enter the text data you want the file to contain,
separating each value with a comma and each row with a new line.

save this file


with
the extension
.csv

You can then open the file using Microsoft Excel or another spreadsheet
program. It would create a table of data similar to the following:

Note : If the fields of data in your CSV file contain commas, you can protect
them by enclosing those data fields in double quotes (").
1.2 Using Data Frame
(i) pandas.DataFrame.to_csv : (To create CSV from DataFrame)
DataFrame.to_csv(path_or_buf=None, sep=', ', na_rep='', float_format=No
ne, columns=None, header=True,index=True, index_label=None, mode='w'
, encoding=None, compression='infer', quoting=None, quotechar='"',line_te
rminator=None, chunksize=None, tupleize_cols=None, date_format=None,
doublequote=True,escapechar=None, decimal='.')

www.pythonclassroomdiary.wordpress.com by Sangeeta M Chauhan pg. 2


Note : While creating CSV File with function to_csv it by default added index
with records

EXAMPLE 1: (With Index)


import pandas as pd

data = {'Name':['Shraddha', 'Shanti','Monica','Yogita'],'Age':[28,34,29,39]}

df = pd.DataFrame(data)

df.to_csv("mycsv.csv")

Note that index


0,1,2,3 have
been added with
records by
default

EXAMPLE 2 : (Without Index)

import pandas as
pd
Here we have made
data = {'Name':['Shraddha', index false
'Shanti','Monica','Yogita'],'Age':[28,34,29,39]}

df = pd.DataFrame(data)

df.to_csv("mycsv1.csv",index=False)

Note here
index is not
appearing
with records

EXAMPLE 3 : (Writing DataFrame containing NaN Values)

www.pythonclassroomdiary.wordpress.com by Sangeeta M Chauhan pg. 3


While
writing
dataFrame
NaN is
replaced by
“Not
Known”

EXAMPLE 4 : (Changing Separator Character while creating CSV file)

Separator
is replaced
with ;

ii. pandas.read_csv (<filepath>) : Importing Data with


pandas.read_csv(filepath, sep=', ', delimiter=None, header='infer', names=None, i
ndex_col=None,usecols=None,skiprows=None,,nrows=None, na_values=None, )

www.pythonclassroomdiary.wordpress.com by Sangeeta M Chauhan pg. 4


Shraddha 28 Gwalior Reading from csv File “mycsv.csv” and store it in
0 Shanti 34 Lucknow Dataframe “df1”
1 Monica 29 Indore
2 Yogita 39 Gwalior

...........................................
0 1 2
0 Shraddha 28 Gwalior Showing Elements without header
1 Shanti 34 Lucknow
2 Monica 29 Indore
3 Yogita 39 Gwalior

...........................................
Name Age City
0 Shraddha 28 Gwalior Column headings are shown with dataFrame
1 Shanti 34 Lucknow
2 Monica 29 Indore
3 Yogita 39 Gwalior

To Give/change Column Headings


Member Name Age City
0 Name Age City
1 Shraddha 28 Gwalior
New column headings are given but old column
2 Shanti 34 Lucknow
3 Monica 29 Indore headings are shown as first row
4 Yogita 39 Gwalior

To skip particular row


MemberName Age City
0 Shraddha 28 Gwalior old column headings are skipped
1 Shanti 34 Lucknow
2 Monica 29 Indore
3 Yogita 39 Gwalior

To read Specific no. of rows from csv fiile

www.pythonclassroomdiary.wordpress.com by Sangeeta M Chauhan pg. 5


Shraddha 28 Gwalior
0 Shanti 34 Lucknow Reading only two rows from file
1 Monica 29 Indore

www.pythonclassroomdiary.wordpress.com by Sangeeta M Chauhan pg. 6

You might also like