List manipulation

Question

Xzarik 0 Newbie Poster

14 Years Ago

Hey all,

I've been having some problem with python - i'm pretty awful at programming but have been soldiering on with this for some time now! What I need to do is take one list and compare it to another. Both lists are presented in rows, and I need to check each row individually from list 1 against all the rows in list 2.

Example:

#List 1 contains data like this:
4
3
1
3
2
1

#List 2 contains data like this:
0, 0800, MON
1, 0830, MON
2, 0900, MON
3, 0930, MON
4, 1000, MON

I want to compare each line of list 1 against all the lines of list 2, so the result should be:

4, 1000, MON
3, 0930, MON
1, 0830, MON
3, 0930, MON
2, 0900, MON
1, 0830, MON

At the end of all this i'll have to put it in with some other data and put it all into a separate .csv for each row.

So far i've imported both sets of data into python but from here i'm a bit stumped.

Could anyone help? I'd be very appreciative!

python

4 Contributors
10 Replies
157 Views
13 Hours Discussion Span
Latest Post 14 Years Ago Latest Post by Xzarik

All 10 Replies

Beat_Slayer 17 Posting Pro in Training

14 Years Ago

Hint:

list1 = [4, 3, 1, 3, 2, 1] 

list2 = [[0, '0800', 'MON'],
         [1, '0830', 'MON'],
         [2, '0900', 'MON'],
         [3, '0930', 'MON'],
         [4, '1000', 'MON']]

for i in range(len(list1)):
    for item in list2:
        if item[0] == list1[i]:
            print item

Cheers and Happy coding

vegaseat 1,735 DaniWeb's Hypocrite

14 Years Ago

You seem to have a tuple of numeric strings. Change it to a list this way ...

data_tuple = '4', '3', '1', '3', '2', '1'
data_list = [int(item) for item in data_tuple]
print(data_list)  # [4, 3, 1, 3, 2, 1]

Beat_Slayer 17 Posting Pro in Training

14 Years Ago

That means that you have a typo, or your reference list is somenthing like:

reference_list = '1, 2, 3, 4'

and for that:

reference_list = [int(item.strip()) for item in reference_list.split(',')]

Cheers and Happy coding

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

Xzarik 0 Newbie Poster · Answer 1 · 2010-08-26T23:59:05+00:00

Oh that's awesome and works very well, thank you so much!

I do however have a tangential problem which is that my list, for some reason, is appearing in the following format:

list 1 = '1', '2', '3', '4' (etc)

Which is a problem, because '1' != 1

Do you have a way to remove everything that isn't a number or a letter from a list? Quite soon i'll also be needing to remove brackets as well as these apostrophes!

Xzarik 0 Newbie Poster · Answer 2 · 2010-08-27T01:10:49+00:00

Sorry to be such a pain, but i'm getting this:

reference_list = [int(item) for item in reference]
ValueError: invalid literal for int() with base 10: ','

What does that mean? =/

Xzarik 0 Newbie Poster · Answer 3 · 2010-08-27T01:20:25+00:00

Aah! Problem solved on that front!

For anyone that's interested, I did the following to sort it:

import csv
reference = csv.reader (open('reference.txt', 'rb'), delimiter=',', quotechar='|')
reference_list = []
reference_list.extend(reference)

However as usual with my life, one problem down and another arises.

I've realised that the way I was getting data for my "list 1" was a complete cheat, because I was getting it like this:

for data in query_list:
    scheduled_periods.append(data[-2][-5:-3])

All well and good, but doesn't really work for three-digit or single-digit values...

The data is presented in rows like this:

LUBS5200M01/SMR 1/02<S1> wks 2-11,"((10 11 12 13 14 15 16 17 18 19) (34))",2

And the number i'm interested in is the last bracketed number... (34). Any ideas how to go about getting this?

I'm so sorry i'm asking so many questions and thank you for helping me!

Beat_Slayer 17 Posting Pro in Training · Answer 4 · 2010-08-27T01:33:37+00:00

import re

data = 'LUBS5200M01/SMR 1/02<S1> wks 2-11,"((10 11 12 13 14 15 16 17 18 19) (34))",2'

data = data.split(',')[1]

match = re.findall(r'\(\d+\)', data)

print match[0][1:-1]

Cheers and Happy coding

Xzarik 0 Newbie Poster · Answer 5 · 2010-08-27T01:40:25+00:00

It worked!

Thank you very very very much for this - it's been days since I last had more than a couple of hours sleep (told you I was terrible at coding!)

For those interested, what I did was changed the value from "[-5:-3]" to "[-6:-3]" which lets it grab 5 characters instead of 4.

for data in query_list:
    scheduled_periods.append(data[-2][-6:-3])

Which meant the second line here was needed, to cope with the extra blank space on the single-digit values. The first line took away that annoying little bracket.

scheduled_periods = [str(item.strip('(')) for item in scheduled_periods]
scheduled_periods = [str(item.strip(' (')) for item in scheduled_periods]

TrustyTony 888 ex-Moderator Team Colleague Featured Poster · Answer 6 · 2010-08-27T02:02:24+00:00

It means you are processing string, not list or tuple. Try to chance to end of line to

in reference.split.(',')]

Xzarik 0 Newbie Poster · Answer 7 · 2010-08-27T06:09:19+00:00

Xzarik 0 Newbie Poster

14 Years Ago

Thanks for all your help guys =)

List manipulation

Recommended Answers Collapse Answers

All 10 Replies

Recommended Answers