create matrix

Question

sofia85 0 Junior Poster in Training

13 Years Ago

Hi,
I want to create a matrix containing 20 rows and 2 cols. I know how to do this, but I have these two files; number and amount (each file contains 1 col and 20 rows) and I don't know how to add these files into this matrix that I'm creating like this,

import numpy as ny
M = ny.zeros((20,2),float)
M[:,0] = number
M=[:,1] = amount

Do I need a for loop to be able to add all the information into all the rows in the matrix? Because now I get this error message saying ValueError: setting an array element with a sequence. Also it complains about my files is not number, e.g. number-file rows looks like this

python

Edited 13 Years Ago by sofia85 because: n/a

2 Contributors
16 Replies
156 Views
9 Hours Discussion Span
Latest Post 13 Years Ago Latest Post by Gribouillis

All 16 Replies

Gribouillis 1,391 Programming Explorer

13 Years Ago

The following works for me

import numpy as ny
data = list(ny.loadtxt(filename) for filename in ("data1.txt", "data2.txt"))
result = ny.array(zip(*data))
print result
print result.shape

If there is a numpy way to do the zip(*data), it would probably be faster. You can also time itertools.izip() for comparison.

Gribouillis 1,391 Programming Explorer

13 Years Ago

If you want to load the files separately, you only need ny.loadtxt("filename") . But you shouldn't worry too much about performance, because 20 lines is not many lines, unless it's called 1000000 times.

Gribouillis 1,391 Programming Explorer

13 Years Ago

Actually its like 1000 lines, but maybe it still works for that purpose.

It should be OK. If you want to be sure, learn to use module timeit (but don't read the file 1000000 times, it's not good for your hard disk).

Edited 13 Years Ago by Gribouillis because: n/a

Gribouillis 1,391 Programming Explorer

13 Years Ago

I found a faster way than zip (a numpyish way)

data = tuple(ny.loadtxt(filename) for filename in ("data1.txt", "data2.txt"))
    result = ny.column_stack(data)
    print result
    print result.shape

""" my output -->
[[  45.56564    45.56564 ]
 [ 564.8484    564.8484  ]
 [ 114.25477   114.25477 ]
 ..., 
 [ 114.25477   114.25477 ]
 [   1.325588    1.325588]
 [   2.36547     2.36547 ]]
(1000, 2)
"""

the ny.column_stack(data) is 166 times faster than array(zip(*data)), with 0.015 milliseconds.

Edited 13 Years Ago by Gribouillis because: n/a

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

sofia85 0 Junior Poster in Training · Answer 1 · 2011-11-09T23:26:34+00:00

So what if I want to write these files; number and amount in two different functions in the same porgram and then just return res1 and res2, And then 'call' on these two results in the 'main program'. Would I be able to use it like I did and skip the zip?

sofia85 0 Junior Poster in Training · Answer 2 · 2011-11-09T23:33:06+00:00

Actually its like 1000 lines, but maybe it still works for that purpose.

sofia85 0 Junior Poster in Training · Answer 3 · 2011-11-09T23:39:31+00:00

Do I need a for loop? I get error TypeError: zip argument #1 must support iteration

Gribouillis 1,391 Programming Explorer Team Colleague · Answer 4 · 2011-11-09T23:41:00+00:00

Do I need a for loop? I get error TypeError: zip argument #1 must support iteration

Did you write the same code that I wrote above ? because for me it runs without errors.

sofia85 0 Junior Poster in Training · Answer 5 · 2011-11-09T23:47:11+00:00

sofia85 0 Junior Poster in Training

13 Years Ago

yes, it doesn't work. I still get error message.

sofia85 0 Junior Poster in Training · Answer 6 · 2011-11-09T23:50:59+00:00

It seems as it can't convert str to float and my outputfiles looks something like this .

Gribouillis 1,391 Programming Explorer Team Colleague · Answer 7 · 2011-11-09T23:52:13+00:00

Gribouillis 1,391 Programming Explorer

13 Years Ago

yes, it doesn't work. I still get error message.

Does

print ny.loadtxt("data1.txt")

print a numpy array, as it should ? Please post code (also make sure to read the documentation for loadtxt())
By the way ny.array(zip(*data)) takes 2.49 milliseconds on my computer with python 2.6 and 2 arrays of size 1000.
Here are my files data1.txt and data2.txt

data1.txt (8.79 KB)

data2.txt (8.79 KB)

Edited 13 Years Ago by Gribouillis because: n/a

sofia85 0 Junior Poster in Training · Answer 8 · 2011-11-10T00:42:52+00:00

sofia85 0 Junior Poster in Training

13 Years Ago

It works now! Thank you!

sofia85 0 Junior Poster in Training · Answer 9 · 2011-11-10T00:47:37+00:00

What if I want to add this column1 together with column2, i.e. col1 + col2, into a column 3 in the same matrix. Could I accomplish that with this kind of code?

Gribouillis 1,391 Programming Explorer Team Colleague · Answer 10 · 2011-11-10T01:49:18+00:00

Sure

data = tuple(ny.loadtxt(filename) for filename in ("data1.txt", "data2.txt"))
    data += (data[0] + data[1],)
    result = ny.column_stack(data)

sofia85 0 Junior Poster in Training · Answer 11 · 2011-11-10T01:53:02+00:00

sofia85 0 Junior Poster in Training

13 Years Ago

Thank you! Now it works perfectly!

Gribouillis 1,391 Programming Explorer Team Colleague · Answer 12 · 2011-11-10T02:03:27+00:00

Gribouillis 1,391 Programming Explorer

13 Years Ago

Thank you! Now it works perfectly!

You are welcome.

create matrix

Recommended Answers Collapse Answers

All 16 Replies

Recommended Answers