parsing and matching to build a data matrix

Question

miac09 -3 Newbie Poster

15 Years Ago

Hi,

I have the following files and can match but, am having trouble with getting the output to write correctly to a data matrix format that I need and would appreciate any help with getting the code right.

File 1 has a list of numbers and Files 2, 3 and 4 have subsets of these numbers along with a column 2 of data. I need to match all the subsets in cols 1 to File 1 and then output a data matrix (file output). see below for file details.

File 1:

File 2: (files 3,4,5.... same format, different subset of numbers and data)

number        data
1000540      x
1000545      a
1000548      w
1000551      t
1001094      a
1001096      a
1002817      w

output file:

(all numbers
from file 1)       file2    file3
1000018                       w
10004                           b
1000422                       c
1000428
1000540      x                x
1000545      a
1000548
1000551
1000635
1000707
1000827
1000854......

Here's the code I've been working with:

Sps={}
for fn in os.listdir('.'):
    if fnmatch.fnmatch(fn,'*.out'):
        file_list={}
        file_list=fn
        Id=file_list[0:3]
        for line in open(fn).readlines():
            line=line.rstrip()
            x=line.split("t")
            pos=x[0]
            alle=x[1]
            SPs[pos]=alle

for line in open("All.pos",'r'):
    line=line.rstrip()
    vals=line.split("t")
    pos1=vals[0]
    Map=SPs.get(pos1)

matrix-multiplication os-x python

Edited 12 Years Ago by mike_2000_17 because: Fixed formatting

3 Contributors
2 Replies
200 Views
1 Day Discussion Span
Latest Post 15 Years Ago Latest Post by woooee

All 2 Replies

woooee 814 Nearly a Posting Maven

15 Years Ago

I would suggest using a dictionary with the individual records in file1 as the key pointing to a list of items found in the other files.

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

TrustyTony 888 ex-Moderator Team Colleague Featured Poster · Answer 1 · 2010-06-05T02:00:33+00:00

Hi,
I have the following files and can match but, am having trouble with getting the output to write correctly to a data matrix format that I need and would appreciate any help with getting the code right-
File 1 has a list of numbers and Files 2, 3 and 4 have subsets of these numbers along with a column 2 of data. I need to match all the subsets in cols 1 to File 1 and then output a data matrix (file output). see below for file details.

File 1:

File 2: (files 3,4,5.... same format, different subset of numbers and data)

number        data
1000540      x
1000545      a
1000548      w
1000551      t
1001094      a
1001096      a
1002817      w

output file:
(all numbers
from file 1) file2 file3

1000018                       w
10004                           b
1000422                       c
1000428
1000540      x                x
1000545      a
1000548
1000551
1000635
1000707
1000827
1000854......

Here's the code I've been working with:

Sps={}
for fn in os.listdir('.'):
	if fnmatch.fnmatch(fn,'*.out'):
		file_list={}
		file_list=fn
		Id=file_list[0:3]
		for line in open(fn).readlines():
			line=line.rstrip()
			x=line.split("\t")
			pos=x[0]
			alle=x[1]
			SPs[pos]=alle
            
for line in open("All.pos",'r'):
	line=line.rstrip()
	vals=line.split("\t")
	pos1=vals[0]
	Map=SPs.get(pos1)

parsing and matching to build a data matrix

Recommended Answers Collapse Answers

All 2 Replies

Recommended Answers