#1
  1. No Profile Picture
    Contributing User
    Devshed Newbie (0 - 499 posts)

    Join Date
    Aug 2002
    Posts
    238
    Rep Power
    12

    Question Problem getting .pdf Titles with CreateRecordSet()


    Hi all,

    This is what I have:
    PHP Code:
    set Q Server.CreateObject("MSSearch.Query")
            
    set util Server.CreateObject("MSSearch.Util")

            
    Q.Catalog "Something"
            
    Q.AllowEnumeration TRUE 
            Q
    .Query CompSearch
            Q
    .SortBy "filename[a]"
            
    Q.Columns "DocTitle, URL, filename, size"
            
    Q.MaxRecords 1000

            wScopeQuery 
    "file:" ActualRoot2 VirtPath
                util
    .AddScopeToQuery QwScopeQuery"deep"
        
                
    set RS Q.CreateRecordSet("sequential")
        
            if 
    Not RS.EOF and Not RS.BOF then
        
                wFilenmtmp 
    LCase(RS("filename"))
                
    StrLen Len(wFilenmtmp)
                
    CharsLeft StrLen 4
                wFilenm 
    Left(wFilenmtmpCharsLeft)        
                
    wExt Right(wFilenmtmp3)
                if 
    wExt "doc"  or wExt "pdf" then
                                wVpathDoc 
    Mid(RS("URL"), Len(ActualRoot) + 5)
                    
    wDoctitle RS("DocTitle")
                    
    wSizeDoc  FormatNumber(RS("size"), 0)
                else    
                        if 
    wExt "zip" then
                        wVpathZip 
    Mid(RS("URL"), Len(ActualRoot) + 5)
                        
    wSizeZip  FormatNumber(RS("size"), 0)
                    
    end if
            
    end if 
    ....other code

    This works great if the files are either of .doc or .zip type. But fails to get the title
    of a .pdf type. The filename, url and size are ok...but the Title is missing when a .pdf
    file is found.
    I have verified many time that the .pdf file do have a title on them, both by checking their properties
    and by mouseover, shows type, title and size.
    The files are managed by VSS.

    Any ideas why the .pdf Titles are not being grabbed?

    Thanks.
  2. #2
  3. No Profile Picture
    Contributing User
    Devshed Newbie (0 - 499 posts)

    Join Date
    Aug 2002
    Posts
    238
    Rep Power
    12
    more info/questions:

    the current rules set in the schema.txt file
    on site server, when the search encounters the .pdf file,
    it gets everything (url and file name) but the title.
    How can I get that .pdf Title to be extracted as well?
    Do I need to set up another rule with a different propid &
    type for pdf Titles? If so....how??

    Thanks!

IMN logo majestic logo threadwatch logo seochat tools logo