Perl Programming
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsProgramming LanguagesPerl Programming

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
Stop making mediocre tutorials.The best tutorials are video! Camtasia Studio makes it easy to create engaging, buzz-building screen videos at any size, in any popular format. Download the free trial!
  #1  
Old May 3rd, 2008, 12:47 PM
PerlNuBe PerlNuBe is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2004
Posts: 23 PerlNuBe User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 23 h 32 m
Reputation Power: 0
I can't crawl on a web page which is using POST

How can I capture the html code from a public / free website which is using POST in the form field?

If I right click and "view source", I can see the html. How do I save this and /or is there a module in cpan?

Thanks

Reply With Quote
  #2  
Old May 3rd, 2008, 02:27 PM
Axweildr's Avatar
Axweildr Axweildr is offline
CPAN medic ...
Click here for more information.
 
Join Date: Mar 2003
Location: Location: Location:
Posts: 10,905 Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)Axweildr User rank is General 20th Grade (Above 100000 Reputation Level)  Folding Points: 119844 Folding Title: Super Ultimate Folder - Level 1Folding Points: 119844 Folding Title: Super Ultimate Folder - Level 1Folding Points: 119844 Folding Title: Super Ultimate Folder - Level 1Folding Points: 119844 Folding Title: Super Ultimate Folder - Level 1Folding Points: 119844 Folding Title: Super Ultimate Folder - Level 1Folding Points: 119844 Folding Title: Super Ultimate Folder - Level 1
Time spent in forums: 3 Months 3 Weeks 6 Days 3 h 56 m 58 sec
Reputation Power: 2304
Send a message via Google Talk to Axweildr
Orkut
I think there could be something missing from your question, what exactly is it you're trying to do, if it's just sucking down the html, you can simply use wget, but I don't think that's what you're after ...
__________________
--Ax
without exception, there is no rule ...
The great thing about Object Oriented code is that it can make small, simple problems look like large, complex ones


09 F9 11 02
9D 74 E3 5B
D8 41 56 C5
63 56 88 C0
Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
-- Jamie Zawinski

Reply With Quote
  #3  
Old May 3rd, 2008, 04:47 PM
PerlNuBe PerlNuBe is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Sep 2004
Posts: 23 PerlNuBe User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 23 h 32 m
Reputation Power: 0
I tried wget from the command line and the entire content of the html page did not display. It's populating it from a database...I think.

The complete page loads into notebook when I right click and view souce.


Quote:
Originally Posted by Axweildr
I think there could be something missing from your question, what exactly is it you're trying to do, if it's just sucking down the html, you can simply use wget, but I don't think that's what you're after ...

Reply With Quote
  #4  
Old May 3rd, 2008, 07:06 PM
keath's Avatar
keath keath is offline
!~ /m$/
Dev Shed Frequenter (2500 - 2999 posts)
 
Join Date: May 2004
Location: Leawood, Kansas
Posts: 2,513 keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level)keath User rank is Colonel (50000 - 60000 Reputation Level) 
Time spent in forums: 1 Week 4 Days 9 h 9 m
Reputation Power: 527
I'm still not sure your question is stated properly. It sounds like you are asking for just the HTML for the form, but since you mentioned POST in the subject, I think you want to post data and get the resulting page.

Either way, you need to use LWP or maybe Mechanize.

There are some examples here of posting with LWP. Come back with specific problems.

Edit: Here's one I remember: previous thread

Last edited by keath : May 3rd, 2008 at 07:12 PM.

Reply With Quote
  #5  
Old May 4th, 2008, 07:03 PM
stonyt10's Avatar
stonyt10 stonyt10 is offline
www guy
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Aug 2006
Location: Columbus, Indiana
Posts: 176 stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level)stonyt10 User rank is First Lieutenant (10000 - 20000 Reputation Level) 
Time spent in forums: 2 Days 11 h 29 m 27 sec
Reputation Power: 119
Code:
I tried wget from the command line and the entire content of the html page did not display. It's populating it from a database...I think.

The complete page loads into notebook when I right click and view souce.



In my experience, if the complete source code appears in your browser then the complete source code will be in your variable with Axweildr's method. You might need to deal with your line breaks before you output it since you will also have those in your variable.
__________________
print qq|Here I am\n| unless -e 'mySocialLife';

Reply With Quote
Reply

Viewing: Dev Shed ForumsProgramming LanguagesPerl Programming > I can't crawl on a web page which is using POST


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump

 Free IT White Papers!
 
Accelerating Trading Partner Performance
One in five. That's how many partner transactions have at least one error. That is an amazing statistic, particularly given the extraordinary leaps in innovation across the global supply chain during the past two decades. Download this white paper to learn more.

 
Competing on Analytics
This Tech Analysis is designed to help identify characteristics shared by analytics competitors, and includes information about 32 organizations that have made a commitment to quantitative, fact-based analysis.

 
Cost Effective Scaling with Virtualization and Coyote Point Systems
An overview of the industry trend toward virtualization, how server consolidation has increased the importance of application uptime and the steps being taken to integrate load balancing technology with virtualized servers.

 
Five Checkpoints to Implementing IP Telephony
Implementation planning for IP PBX software and IP telephony has become vital as businesses replace discontinued legacy PBX phone systems. This informative whitepaper outlines five "checkpoints" for any implementation plan that will help make IP communications a successful proposition.

 
Hosted Email Security: Staying Ahead of New Threats
In the last two years, email has become a fierce battleground between the nefarious forces of spam and malware, and the heroes of messaging protection. The spam volumes increased alarmingly every month, bringing clever new forms of phishing and virus propagation attacks.

 

Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 5 hosted by Hostway