ColdFusion Development
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
User Name:
Password:
Remember me
Go Back   Dev Shed ForumsProgramming Languages - MoreColdFusion Development

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread Dev Shed Forums Sponsor:
  #1  
Old May 12th, 2005, 07:41 AM
rexx rexx is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: May 2005
Posts: 6 rexx User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 43 m 50 sec
Reputation Power: 0
Verity vs. Adobe PDF

Hi,

I set up a Verity search on my site to comb files such as PowerPoint, PDF, Word Docs, etc.

Everything works great, except my summaries for PDF files have no spaces. They just come in as one solid string 500 characters long. They appear to break at the end of a sentence, but I can't tell if that's just because there's a period at the end, and of course the string breaks at a hyphen.

I've searched all over Google and can't find anything. Has anybody else had this happen, or have any ideas on how to fix it? It looks so sloppy!

Thanks

Reply With Quote
  #2  
Old May 12th, 2005, 08:39 AM
kiteless kiteless is offline
Moderator
Dev Shed Expert (3500 - 3999 posts)
 
Join Date: Jun 2002
Location: Raleigh, NC
Posts: 3,689 kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level) 
Time spent in forums: 1 Week 4 Days 16 h 33 m 51 sec
Reputation Power: 53
Vertiy will index PDFs and the summaries do show up fine, so I'm not sure what the problem is. Are you by chance using some kind of whitespace suppression servlet or program that compresses the returned HTTP stream?
__________________
Ask if you have a question, but also help answer questions that you have knowledge of! Thanks, Brian.
How to Post a Question in the Forums

Reply With Quote
  #3  
Old May 12th, 2005, 12:21 PM
rexx rexx is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: May 2005
Posts: 6 rexx User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 43 m 50 sec
Reputation Power: 0
Quote:
Originally Posted by kiteless
Vertiy will index PDFs and the summaries do show up fine, so I'm not sure what the problem is. Are you by chance using some kind of whitespace suppression servlet or program that compresses the returned HTTP stream?



None that I'm aware of. All the summaries for PowerPoint and Word files come out fine.


EDIT: I should also mention that I am using CFMX 6.1.

Reply With Quote
  #4  
Old May 12th, 2005, 12:38 PM
kiteless kiteless is offline
Moderator
Dev Shed Expert (3500 - 3999 posts)
 
Join Date: Jun 2002
Location: Raleigh, NC
Posts: 3,689 kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level)kiteless User rank is Sergeant Major (2000 - 5000 Reputation Level) 
Time spent in forums: 1 Week 4 Days 16 h 33 m 51 sec
Reputation Power: 53
6.1 should do this fine. Is it all the PDF files? Are they possibly old PDF versions?

Reply With Quote
  #5  
Old May 13th, 2005, 01:56 AM
rexx rexx is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: May 2005
Posts: 6 rexx User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 43 m 50 sec
Reputation Power: 0
It seems to be all the PDFs. I'll check with the guy making them to see what version he's using.

Reply With Quote
  #6  
Old May 13th, 2005, 12:06 PM
rexx rexx is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: May 2005
Posts: 6 rexx User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 43 m 50 sec
Reputation Power: 0
So some PDFs are coming out ok, but not many.

The guy making them is using Word and exporting to Acrobat 5.0.

Reply With Quote
  #7  
Old May 13th, 2005, 01:36 PM
rexx rexx is offline
Registered User
Dev Shed Newbie (0 - 499 posts)
 
Join Date: May 2005
Posts: 6 rexx User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 43 m 50 sec
Reputation Power: 0
If anybody would like to take a look:

securityinc.com/round01/

A search in the documents for "battery" is a good example.

Reply With Quote
Reply

Viewing: Dev Shed ForumsProgramming Languages - MoreColdFusion Development > Verity vs. Adobe PDF


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 1 hosted by Hostway
Stay green...Green IT