UNIX Help
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsOperating SystemsUNIX Help

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
VeriSign Code Signing Digital Certificates provides assurance to end users. Read about this and more in the free white paper: “How to Digitally Sign Downloadable Code for Secure Content Transfer.” Learn More!
  #1  
Old July 1st, 2004, 03:41 AM
huckleberry huckleberry is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Feb 2004
Posts: 26 huckleberry User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 16 sec
Reputation Power: 0
Help with AWK

Hi, I need urgent help with trying to recover some data....basically I have a log file containing data such as EMAIL_ADDRESS = [test@test.com],NAME = [test], etc. This line of data can be anywhere in the file and in a thousand lines of data may only occur a hanful of times.
Now I need to be able to read through this file and extract all the email addresses. I want to extract the string EMAIL_ADDRESS = [test@test.com]. I have tried using awk to do this but I am not doing something quite right.
BEGIN { }
match($0, pattern) {

print substr($0, RSTART, RLENGTH);
}
pattern=/EMAIL_ADDRESS.*=.*\[.*\]/

END { }
Can anyone help please ?

Reply With Quote
  #2  
Old July 1st, 2004, 03:59 AM
stevengs stevengs is offline
Permanently Banned
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jun 2004
Location: Germany
Posts: 394 stevengs User rank is Lance Corporal (50 - 100 Reputation Level)stevengs User rank is Lance Corporal (50 - 100 Reputation Level)stevengs User rank is Lance Corporal (50 - 100 Reputation Level) 
Time spent in forums: 3 Days 4 h 36 m 24 sec
Warnings Level: 10
Number of bans: 1
Reputation Power: 0
phew, Friend, you've bit off a huge chunk. Have you ever looked at the email address specification (RFC-822 ftp://ftp.rfc-editor.org/in-notes/rfc822.txt)? The specification is extremely complicated, and writing a regex for it is nearly impossible for us mere mortals. As an example, the perl regex for e-mail addresses is over 60 lines long. To give you an idea, here is a perfectly legal email address:

Jeffy <"That Tall Guy"@ora.com (this address no longer active)>

imagine the fun of parsing that!! And this example hasn't taken advantage of but just a tiny portion of the freedoms in the specification.

I would seriously recommend using existing libraries if at all possible. I mentioned Perl, but there are implementations in nearly every language.

Good luck.

-Steven

Reply With Quote
Reply

Viewing: Dev Shed ForumsOperating SystemsUNIX Help > Help with AWK


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump

 Free IT White Papers!
 
Accelerating Trading Partner Performance
One in five. That's how many partner transactions have at least one error. That is an amazing statistic, particularly given the extraordinary leaps in innovation across the global supply chain during the past two decades. Download this white paper to learn more.

 
Competing on Analytics
This Tech Analysis is designed to help identify characteristics shared by analytics competitors, and includes information about 32 organizations that have made a commitment to quantitative, fact-based analysis.

 
Cost Effective Scaling with Virtualization and Coyote Point Systems
An overview of the industry trend toward virtualization, how server consolidation has increased the importance of application uptime and the steps being taken to integrate load balancing technology with virtualized servers.

 
Five Checkpoints to Implementing IP Telephony
Implementation planning for IP PBX software and IP telephony has become vital as businesses replace discontinued legacy PBX phone systems. This informative whitepaper outlines five "checkpoints" for any implementation plan that will help make IP communications a successful proposition.

 
Hosted Email Security: Staying Ahead of New Threats
In the last two years, email has become a fierce battleground between the nefarious forces of spam and malware, and the heroes of messaging protection. The spam volumes increased alarmingly every month, bringing clever new forms of phishing and virus propagation attacks.

 

Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 4 hosted by Hostway