Using nested list with unicode data without it showing as such

Question

fingerpainting 0 Newbie Poster

13 Years Ago

Context

I'm retrieving data from Google Analytics (via the python-googleanalytics library, as Google's API is way too complex for me right now) and putting that data into a table using an HTML library.

Python experience: very low.

Problem

I have this:

[integer]
[integer]
[integer]
[integer]
[integer]

The strings end up in a table column together and the integers in a table column. That's what I want. However, I don't want it to show any of the punctuation marks or the unicode u, just the bare strings and integers, but still in separate table columns.

What I'm trying to do is iterate over the list and the lists within the list to take all the values and encode them as ascii, but this cannot be done on integers. Maybe I should iterate over the strings and integer lists separately, but how should I separate them?

clean = []
for rows in top10:
    for x in rows:
        for i in x:
            i = i.encode('ascii','ignore')
            i = str(i)
            clean.append(i)

Gives:

Traceback (most recent call last):
File "C:\Python26\googlescrape.py", line 31, in <module>
i = i.encode('ascii','ignore')
AttributeError: 'int' object has no attribute 'encode'

I would be grateful if someone could help me get to the next step.

analytics encode google list nested python

3 Contributors
3 Replies
251 Views
12 Hours Discussion Span
Latest Post 13 Years Ago Latest Post by fingerpainting

All 3 Replies

TrustyTony 888 pyMod

13 Years Ago

Did you try:

clean = []
for rows in top10:
    for x in rows:
        for i in x:
            i = str(i)
            i = i.encode('ascii','ignore')
            clean.append(i)

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

Beat_Slayer 17 Posting Pro in Training · Answer 1 · 2010-08-24T20:20:12+00:00

Why not?

top10 = [
        [[u'abcde', u'fghij'], [1]],
        [[u'abcde', u'fghij'], [2]],
        [[u'abcde', u'fghij'], [3]],
        [[u'abcde', u'fghij'], [4]],
        [[u'abcde', u'fghij'], [5]]]

top10 = [[[str(a), str(b)], x] for [a, b], x in top10]

for item in top10:
    print item

Output:
[['abcde', 'fghij'], [1]]
[['abcde', 'fghij'], [2]]
[['abcde', 'fghij'], [3]]
[['abcde', 'fghij'], [4]]
[['abcde', 'fghij'], [5]]

Cheers and Happy coding

fingerpainting 0 Newbie Poster · Answer 2 · 2010-08-24T21:35:43+00:00

Thanks for your help tonyjv and beat slayer.

When I do the str() first, I get each character with the strings in a separate table column and each string and integer in a separate table row. However, the list 'clean' does return the values I want, so I'll try and tweak the HTML output.

When I use the list comprehension, the initial output is perfectly represented without unicode. Here too I will probably just have to tweak the HTML output so it doesn't show the list brackets.

Thanks! Will mark this as solved.

Using nested list with unicode data without it showing as such

Recommended Answers Collapse Answers

All 3 Replies

Recommended Answers