Create a Python Dictionary From a CSV File using CSV Module

Question

gishi 0 Newbie Poster

14 Years Ago

hi!

i want to place the contents of a dictionary into a csv file. Can someone help me with this?

i already have the code in reading the csv file

import csv

reader = csv.reader(open("c:\sample.dat"))

for row in reader:
print row

i want the first element of the row be the key for the dictionary so that if i access the dictionary again using the key i'll be able to get the different of the rows of that dictionary. I just don't know how to use loop thru the csv rows and put it in the dictionary and use the first element as a key.

for example: key name is key:
//key as the key

key == first element of the row.

d = {'key': [1,2,3,4,5,6]}

//accessing the dictionary using key and were able to get the other elements of the row
d [1, 2, 3, 4, 5, 6]

can someone help me code this.

Thanks!

python

7 Contributors
20 Replies
15K Views
5 Years Discussion Span
Latest Post 9 Years Ago Latest Post by Gribouillis

All 20 Replies

jice 53 Posting Whiz in Training

14 Years Ago

import csv

reader = csv.reader(open("c:\sample.dat"))

d={}
for row in reader:
    d[row[0]]=row[1:]

jice 53 Posting Whiz in Training

14 Years Ago

My 2 cents
To use csv with other dialect and use dict(zip()) instead of named tuple.

"""
datas.csv :
"123"; "gishi"; "gishi@mymail.com"; "456 happy st."
"345"; "tony"; "tony.veijalainen@somewhere.com"; "Espoo Finland"
"""
import csv
class excel_french(csv.Dialect):
    delimiter=';'
    quotechar='"'
    doublequote=True
    skipinitialspace=False
    lineterminator='\n'
    quoting=csv.QUOTE_MINIMAL

csv.register_dialect('excel_french', excel_french)

header=['id', 'name', 'email', 'homeaddress']
d={}
for row in csv.reader(open('datas.csv'), 'excel_french'):
    drow=dict(zip(header, row))
    d[drow['id']]=drow
print d

>>>
{'123': {'email': ' "gishi@mymail.com"',
         'homeaddress': ' "456 happy st."',
         'id': '123',
         'name': ' "gishi"'},
 '345': {'email': ' "tony.veijalainen@somewhere.com"',
         'homeaddress': ' "Espoo Finland"',
         'id': '345',
         'name': ' "tony"'}}

If the first row of the file is the header line :

"""
datas.csv :
"id"; "name"; "email"; "homeaddress"
"123"; "gishi"; "gishi@mymail.com"; "456 happy st."
"345"; "tony"; "tony.veijalainen@somewhere.com"; "Espoo Finland"
"""
import csv
class excel_french(csv.Dialect):
    delimiter=';'
    quotechar='"'
    doublequote=True
    skipinitialspace=False
    lineterminator='\n'
    quoting=csv.QUOTE_MINIMAL

csv.register_dialect('excel_french', excel_french)

d={}
for i, row in enumerate(csv.reader(open('datas.csv'), 'excel_french')):
    if i==0:
        header=row
    else:
        drow=dict(zip(header, row))
        d[drow['id']]=drow
print d

Edited 14 Years Ago by jice because: n/a

jice 53 Posting Whiz in Training

14 Years Ago

Have you tried the examples we gave you ?
Did you simply read them carefully ?
To me, all examples given allow what you've asked for !

lrh9 95 Posting Whiz in Training

14 Years Ago

Don't separate the user id from the entry.

import csv, contextlib

usernfo = {}

with contextlib.closing(open()) as ifile:
    for entry in csv.DictReader(ifile):
        usernfo[entry['id']] = entry

print(usernfo)

lrh9 95 Posting Whiz in Training

14 Years Ago

Don't separate the user id from the entry.

import csv, contextlib

usernfo = {}

with contextlib.closing(open(data.csv)) as ifile:
    for entry in csv.DictReader(ifile):
        usernfo[entry['id']] = entry

print(usernfo)

Fixed.

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

gishi 0 Newbie Poster · Answer 1 · 2010-06-29T09:37:44+00:00

import csv

reader = csv.reader(open("c:\sample.dat"))

d={}
for row in reader:
    d[row[0]]=row[1:]

hi! your code worked. thanks!
But I have another question:

since the key is the first element of the row, i want to access other elements like this:

d [fieldname]

i can access the elements by doing this d [2]
but i need to know on what position in the array is that element. It would be better to use the fieldname. Can you help with this?

example csv:

123, gishi, gishi@mymail.com, 456 happy st.

fields:
id, name, email, homeaddress

thanks again!

Beat_Slayer 17 Posting Pro in Training · Answer 2 · 2010-06-29T10:48:16+00:00

Beat_Slayer 17 Posting Pro in Training

14 Years Ago

Can't you do like this:

d['key']['fieldname']

gishi 0 Newbie Poster · Answer 3 · 2010-06-29T10:55:19+00:00

Can't you do like this:
d['key']['fieldname']

You can do that if you already have the fieldnames for the csv elements. If you don't have that you can access the elements by using the position of the element in the array.
That is why I was asking for someone to help me write headers for my csv files.

Beat_Slayer 17 Posting Pro in Training · Answer 4 · 2010-06-29T11:08:10+00:00

If the fields are in the same position you can tweak a little the read function and add the fields.

Can you post a sample csv?

gishi 0 Newbie Poster · Answer 5 · 2010-06-29T12:05:22+00:00

If the fields are in the same position you can tweak a little the read function and add the fields.
Can you post a sample csv?

example is like this:

123, gishi, gishi@mymail.com, 456 happy st.

fields:
id, name, email, homeaddress

Beat_Slayer 17 Posting Pro in Training · Answer 6 · 2010-06-29T14:30:21+00:00

Hope it helps.

import csv

d={}

for row in csv.reader(open('sample.dat')):
    d['ID %s' % row[0]] = {'name': row[1], 'email': row[2], 'homeaddress': row[3]}

print d
print

user_id = 125

print 'ID %s:' % user_id, d['ID %s' % user_id]
print
print 'ID %s' % user_id
print 'Name:', d['ID %s' % user_id]['name'],
print 'Email:', d['ID %s' % user_id]['email'],
print 'Home Address:', d['ID %s' % user_id]['homeaddress']

Output:

>>> 
{'ID 126': {'homeaddress': ' 459 happy st.', 'name': ' gishi4', 'email': ' gishi4@mymail.com'}, 'ID 124': {'homeaddress': ' 457 happy st.', 'name': ' gishi2', 'email': ' gishi2@mymail.com'}, 'ID 125': {'homeaddress': ' 458 happy st.', 'name': ' gishi3', 'email': ' gishi3@mymail.com'}, 'ID 123': {'homeaddress': ' 456 happy st.', 'name': ' gishi', 'email': ' gishi@mymail.com'}}

ID 125: {'homeaddress': ' 458 happy st.', 'name': ' gishi3', 'email': ' gishi3@mymail.com'}

ID 125
Name:  gishi3 Email:  gishi3@mymail.com Home Address:  458 happy st.

TrustyTony 888 ex-Moderator Team Colleague Featured Poster · Answer 7 · 2010-06-29T14:33:29+00:00

Also you can use namedtuples, and csv module is not really needed for simple cases.

from collections import namedtuple
filein = open("sample.dat")
datadict={}

headerline = [f.strip() for f in filein.readline().split(',')]
print headerline
Dataline=namedtuple('Dataline',headerline)

for data in filein:
    data=[f.strip() for f in data.split(',')]
    print(data)
    d=Dataline(*data)

    datadict[d.id]=d

print datadict['123'].email
for key,value in  datadict.items():
    print('%s: %s' %(key,value))

input('Ready')

sample.dat

id, name, email, homeaddress
123, gishi, gishi@mymail.com, 456 happy st.
345, tony, tony.veijalainen@somewhere.com, Espoo Finland

Output:

['id', 'name', 'email', 'homeaddress']
['123', 'gishi', 'gishi@mymail.com', '456 happy st.']
['345', 'tony', 'tony.veijalainen@somewhere.com', 'Espoo Finland']
gishi@mymail.com
345: Dataline(id='345', name='tony', email='tony.veijalainen@somewhere.com', homeaddress='Espoo Finland')
123: Dataline(id='123', name='gishi', email='gishi@mymail.com', homeaddress='456 happy st.')
Ready

Beat_Slayer 17 Posting Pro in Training · Answer 8 · 2010-06-29T14:42:25+00:00

I thought the same Tony was just starting doing a namedtuple version. Luckly i stoped by to edit my post.

For the csv module I still use because I thought maybe for some csv may have some type of encoding.

TrustyTony 888 ex-Moderator Team Colleague Featured Poster · Answer 9 · 2010-06-29T14:55:45+00:00

I have in the past bitter experience of how compatible output the CSV are and wrote my own CSV module in Delphi, so I do think that for more complicated things CSV module must be great (dialects are not there without reason, I can tell you). I only do not know it so well yet, and I like to hold whole program in my mind. I have not used in my own code namedtuple, so thought it good opportunity to learn how to put it in use and same time maybe help somebody else.

P.S. Have a look at collections source code: the code for namedtuple is fine example how expressive Python is. My namedtuple version looks like candidate for code snippet, for me, what do you think?

gishi 0 Newbie Poster · Answer 10 · 2010-06-30T17:04:10+00:00

Hope it helps.

import csv

d={}

for row in csv.reader(open('sample.dat')):
    d['ID %s' % row[0]] = {'name': row[1], 'email': row[2], 'homeaddress': row[3]}

print d
print

user_id = 125

print 'ID %s:' % user_id, d['ID %s' % user_id]
print
print 'ID %s' % user_id
print 'Name:', d['ID %s' % user_id]['name'],
print 'Email:', d['ID %s' % user_id]['email'],
print 'Home Address:', d['ID %s' % user_id]['homeaddress']

Output:

>>> 
{'ID 126': {'homeaddress': ' 459 happy st.', 'name': ' gishi4', 'email': ' gishi4@mymail.com'}, 'ID 124': {'homeaddress': ' 457 happy st.', 'name': ' gishi2', 'email': ' gishi2@mymail.com'}, 'ID 125': {'homeaddress': ' 458 happy st.', 'name': ' gishi3', 'email': ' gishi3@mymail.com'}, 'ID 123': {'homeaddress': ' 456 happy st.', 'name': ' gishi', 'email': ' gishi@mymail.com'}}

ID 125: {'homeaddress': ' 458 happy st.', 'name': ' gishi3', 'email': ' gishi3@mymail.com'}

ID 125
Name:  gishi3 Email:  gishi3@mymail.com Home Address:  458 happy st.

Thanks for your answer but the answer i am looking for is like this:

when enter the id: d[address]
it will just give me the output: 456 happy st.

can you help me do this?

Beat_Slayer 17 Posting Pro in Training · Answer 11 · 2010-06-30T17:32:22+00:00

I have in the past bitter experience of how compatible output the CSV are and wrote my own CSV module in Delphi, so I do think that for more complicated things CSV module must be great (dialects are not there without reason, I can tell you). I only do not know it so well yet, and I like to hold whole program in my mind. I have not used in my own code namedtuple, so thought it good opportunity to learn how to put it in use and same time maybe help somebody else.
P.S. Have a look at collections source code: the code for namedtuple is fine example how expressive Python is. My namedtuple version looks like candidate for code snippet, for me, what do you think?

I was seeing the collections little time ago, I used the deque on other project.

And yes, just post it, I liked it, pretty good.

gishi 0 Newbie Poster · Answer 12 · 2010-06-30T17:56:20+00:00

Hope it helps.

import csv

d={}

for row in csv.reader(open('sample.dat')):
    d['ID %s' % row[0]] = {'name': row[1], 'email': row[2], 'homeaddress': row[3]}

print d
print

user_id = 125

print 'ID %s:' % user_id, d['ID %s' % user_id]
print
print 'ID %s' % user_id
print 'Name:', d['ID %s' % user_id]['name'],
print 'Email:', d['ID %s' % user_id]['email'],
print 'Home Address:', d['ID %s' % user_id]['homeaddress']

Output:

>>> 
{'ID 126': {'homeaddress': ' 459 happy st.', 'name': ' gishi4', 'email': ' gishi4@mymail.com'}, 'ID 124': {'homeaddress': ' 457 happy st.', 'name': ' gishi2', 'email': ' gishi2@mymail.com'}, 'ID 125': {'homeaddress': ' 458 happy st.', 'name': ' gishi3', 'email': ' gishi3@mymail.com'}, 'ID 123': {'homeaddress': ' 456 happy st.', 'name': ' gishi', 'email': ' gishi@mymail.com'}}

ID 125: {'homeaddress': ' 458 happy st.', 'name': ' gishi3', 'email': ' gishi3@mymail.com'}

ID 125
Name:  gishi3 Email:  gishi3@mymail.com Home Address:  458 happy st.

thanks for your answer. however what i need is like this:

d[address]

the output would be: 456 happy st.

instead of placing d[2]

can you help me with this again? thanks!

jice 53 Posting Whiz in Training · Answer 13 · 2010-06-30T20:49:29+00:00

Are you kidding ?
It's just the example I gave 5 post ago !

Abdulkabir_1 0 Newbie Poster · Answer 14 · 2015-11-27T13:05:01+00:00

Thank you everyone! I'm new to data science using python
Please I have a problem with pandas.DataFrame.read_cvs
"for index, row in iterrow:" doesn't work on Python 2.7 and python 3.4

    However, this is my question:
        I have a cvs file with 5 columns and multiple rows, as shown below:
        `Author_ID      Arrival       Departure         Date                 Time`
          01202             Paris          New York         10/03/2011      10:00
          02122             Beijin         New York         09/03/1999      21:00
          07732             Paris          Kansas              10/03/2011     10:00


          from the table above you can discover that some column values match. I want to extract wherever I find intersection. e.g so as we iterate whenever we encounter a row with the same values in Arrival or Depature or Author_ID and date && Time, we should extract and display 
        Take Arrival column, we have Paris from 2 different Authors, then consider the Date we have 10/03/2011 and "Time" we have 10:00. We want to know if these authors have connections with their depatures, arrivals at particular date and time.

Please can anyone help?

Gribouillis 1,391 Programming Explorer Team Colleague · Answer 15 · 2015-11-27T14:56:26+00:00

@Abdulkabir_1 You say that for index, row in iterrow: does'nt work, but we don't know what it means because we don't have your python code. Perhaps you could start a new discussion with your code and the error message sent by python. Also, it would be a good idea to describe more precisely your program's expected output.

Create a Python Dictionary From a CSV File using CSV Module

Recommended Answers Collapse Answers

All 20 Replies

Recommended Answers