hopefully easy to solve this simple question

Question

doctorjo5 0 Light Poster

14 Years Ago

I know that this is an easy one but it is hard for me to figure out.
I need to make a program that can:
* Open the file and read through all of the lines in the file. (got this)
* If a line starts with "Subject:", skip the line. (got this)
* For all the other lines, split the strings into a list of words
using the white space and find the first word. After you have
found the first word, split that string again into a *new* list of
words using the "/" character. (got this I think with the words=line.split())
* Look at the second word of the *new* list. If it does not contain
"trunk" or "branches", skip the line. (stuck on this)
* For the lines we still have left, look at the first word of the
*new* list, and use that word to index your dictionary of running
counts for each module. (stuck on this)
* At the end of the program, print out your dictionary of counts

What I don't understand is how to split the line a second time to look at the second word that the above guideline is referring to. That is stopping me from even beginning the search for the words 'branches' and 'trunk'. Here is the code that I have written so far:

file = raw_input('Enter a file name: ')
    try:
        fhand = open(file)
    except:
        'file cannot be opened:', file

    counts=dict()
    for line in fhand:
        if line.startswith('Subject:'):
            continue
        else:
            words=line.split()
            modified=words[0]
            if len(words) == 0 : continue
            modified=modified.split('/')
            x=modified[1]
            if len(modified) == 0 : continue
            if x not in counts:
                counts[x]=1
            else:
                counts[x] = counts[x]+1
               
    lst = list()
    for val, key in counts.items():
        lst.append( (val, key) )
       
    lst.sort()

    for val, key in lst[:] :
        print val, key

I would greatly appreciate any advice. Thank you in advance!

python

4 Contributors
6 Replies
167 Views
23 Hours Discussion Span
Latest Post 14 Years Ago Latest Post by woooee

woooee 814 Nearly a Posting Maven

14 Years Ago

This line does nothing

if len(modified) == 0 : continue

Look at the second word of the *new* list. If it does not contain
"trunk" or "branches", skip the line. (stuck on this)

Use the "in" keyword.

**---  will return a positive for words like "strunk"
found = False
for word in ["trunk", "branches"] :
    if word in words[1]:
        found = True
if not found:
    print "processing this"

Edited 14 Years Ago by woooee because: n/a

doctorjo5 commented: this is helpful +0

The_Kernel 33 Light Poster

14 Years Ago

Use the "in" keyword.

**---  will return a positive for words like "strunk"
found = False
for word in ["trunk", "branches"] :
    if word in words[1]:
        found = True
if not found:
    print "processing this"

Wouldn't it make sense to do this the opposite way? i.e.

if words[1] in ["trunk", "branches"]:
    found = True
else:
    found = False

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

doctorjo5 0 Light Poster · Answer 1 · 2010-02-24T07:24:29+00:00

Also, if you can't offer any advice but might know a place where I could read information that explicitly deals with a problem like this, I would really appreciate that too. Thanks again.

redyugi 5 Junior Poster in Training · Answer 2 · 2010-02-24T07:26:35+00:00

I only got a minute to so I can only offer a bit of advice right now.
Don't use "file" as a variable name, because it is a built in function. Use something like "file1" instead

doctorjo5 0 Light Poster · Answer 3 · 2010-02-24T21:14:21+00:00

wooee's help is really good, but I don't see where he is inserting the "in" code. I don't see anywhere in my code where I can use a loop that starts with

**--- will return a positive for words like "strunk"
found = False

I am not sure what **--- does and I don't understand where he got the word "strunk". Are these lines that he commented out but I'm not seeing the '#' symbol for some reason?

Thank you.

woooee 814 Nearly a Posting Maven · Answer 4 · 2010-02-25T00:20:59+00:00

It depends on whether you want to find sub-words or not. The simple example below shows the difference.

words_master = [ "the", "there", "theres"]

##--- look for word in list
word = "there"
if word in words_master:
    print(word, "found")
else:
    print(word, "Not found")

##--- compare each word in the list to the original word
print("-"*30)
for m_word in words_master:
    if m_word in word:
        print("%s found in %s" % (m_word, word))