Scripts
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsWeb Site ManagementScripts

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old July 21st, 2008, 07:23 AM
ahjiefreak ahjiefreak is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: Jul 2008
Posts: 7 ahjiefreak User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 h 34 m 10 sec
Reputation Power: 0
How to extract elements using Awk

Hi,

I have this typical extraction problem in AWK.

I have 3 input files..

i) First one is somehow like an oracle of:-
foo 12,23,24
bla 11,34
car 35

ii)Second file is basically detailing the score for each of the second field of first file. Besides, for the first column, it is the position or rank that these score located.

1 0.345 24
2 0.231 1
3 0.220 24
4 0.1090 12

iii) Third file contain other information of the second field of first file.
Line Poll NotPoll
10 3 1
12 1 2
23 3 1
24 2 4

Initially, i try to getline for file 1 to check with the position (1st field of second file). I will only pick the most minimum position. In this case , when it scans the first line of file1, it will check position returned by 12,23 and 24 from second file.

From this second file, it will only pick the smallest position (e.g in this case, it is 1 due to "24"). This have been done. However, now i would like to link the "24" to third file. This is to enable me to match with the first field of the 3rd file. I wanted to extract further information of POll and Not Poll (which is 2 and 4 respectively).

In my below code, I only able to print the smallest position($2) but not the exact corresponding element. Please advise. Appreciate your help.

Code:
#!/usr/bin/awk -f 
{
repo = $1
split($2, search_vals, ",")
delete found

while (getline < (repo "/file2.txt")) {
	min=0;
	max=0;
	sum=0;
	scores[$1]=$2;

	for (k in search_vals) {
		if ($3 == search_vals[k]) {
	       		found[$3] = $1;
	       		flag=1;	
			break;
 		}
	}
}

asort(found)

pos=found[1]

print pos;

while (getline < (repo "/file2.txt"))  {
     if(pos==$1){
        lineno=$3;
	break;
     }
}

close(repo "/file2.txt")

print lineno;

if(flag){
score=scores[found[1]]

for(i in scores){
  if(scores[i]==score)
    newscores[i]=scores[i];
}
	min=found[1];	

   for(i in newscores){
      if(newscores[i]== score)
       {
         if(int(i)<int(min))
            min=i
         if(int(i)>int(max))
            max=i
       }
  }     

	gap=int(max)-int(min)+1;
	
	for(x=int(min); x<=int(max); x++){
   		sum+=x;
   		
	}
	if(gap==1){
	   c=min;

	}
	else{
           c=int(0.5+sum/gap);
    	}
    	
   split("", scores) 
   split("", newscores) 
} 

while (getline < (repo "/3rdfile.txt")) {
  
 
  last=$1;
}

m=c/last*100;

print repo,found[1]>"test.txt"

}


In the above code, I try to implement the above described but the line after the BOLD which is print lino. is not working. As in the bold form, I added an extra while loop to again extracting the exact number apart from the previous position they obtained. E.g( apart from returning the position, it return the corresponding 3rd field of information.) However, it doesn't return me anything.

Please advise. Appreciate alot.


Many thanks.

Reply With Quote
Reply

Viewing: Dev Shed ForumsWeb Site ManagementScripts > How to extract elements using Awk


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 4 hosted by Hostway