Python Programming
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me

The Shed is going Social! Join us on FaceBook and Twitter and chime in on the conversation.

Go Back   Dev Shed ForumsProgramming LanguagesPython Programming

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old March 10th, 2013, 07:39 PM
rbdeforest rbdeforest is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Mar 2013
Posts: 5 rbdeforest User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 44 m 18 sec
Reputation Power: 0
Index out of range problems

Hi everyone,

I'm rather new at python programming and teaching myself how via youtube. For one of the tutorials he taught how to fetch stuff from sites called screen scraping. I got the code he gives and it works fine with his example rss feed, but then when I try to do it myself on another feed it says
"print findPatTitle [i] IndexError: list index out of range"

here's the code which results in the error:
Code:
from bs4 import BeautifulSoup
from urllib import urlopen
import re

webpage = urlopen('http://www.kayak.com/h/rss/deals').read()
patFinderTitle = re.compile('<title>(.*)</title>')
findPatTitle = re.findall(patFinderTitle,webpage)

listIterator = []
listIterator[:] = range(2,10)

for i in listIterator:
    print findPatTitle [i]


I just would like to simply know how come the site he provided works, while the one I'm trying does not.

Thanks in advance!

Reply With Quote
  #2  
Old March 10th, 2013, 09:30 PM
b49P23TIvg's Avatar
b49P23TIvg b49P23TIvg is offline
Contributing User
Dev Shed Loyal (3000 - 3499 posts)
 
Join Date: Aug 2011
Posts: 3,347 b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level)b49P23TIvg User rank is Major (30000 - 40000 Reputation Level) 
Time spent in forums: 1 Month 2 Weeks 3 Days 6 h 53 m 17 sec
Reputation Power: 383
I think you need a better tutorial. The program is too complicated.

Code:
from urllib import urlopen
import re

webpage = urlopen('http://www.kayak.com/h/rss/deals').read()
patFinderTitle = re.compile('<title>(.*)</title>')

titles = patFinderTitle.findall(webpage)

for (i,title,) in enumerate(titles):
    print('title %2d: %s'%(i,title,)
Run this program, you'll see there are 8 titles, indexes 0 through 7. With the crazy list iterator you were trying to access the unavailable indexes 8 and 9 causing, appropriately, IndexError.
__________________
[code]Code tags[/code] are essential for python code!

Reply With Quote
  #3  
Old March 11th, 2013, 11:02 PM
rbdeforest rbdeforest is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Mar 2013
Posts: 5 rbdeforest User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 44 m 18 sec
Reputation Power: 0
Perfect

Thanks alot!

Reply With Quote
Reply

Viewing: Dev Shed ForumsProgramming LanguagesPython Programming > Index out of range problems

Developer Shed Advertisers and Affiliates



Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump

Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 


Powered by: vBulletin Version 3.0.5
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.

© 2003-2013 by Developer Shed. All rights reserved. DS Cluster - Follow our Sitemap