#1
  1. No Profile Picture
    Masked Chicken
    Devshed Newbie (0 - 499 posts)

    Join Date
    Jul 2001
    Location
    Ohio/Pennsylvania
    Posts
    107
    Rep Power
    14

    Coldfusion + solr, doccount bug?


    I run cfindex and point it to a directory with 4,000 files to be indexed. After the indexing is completed, I'll run this tag:

    <cfcollection action="list" name="collectionlist" engine="solr">
    <cfdump var=#collectionlist#>

    I'll see my new index in the query, however the DOCCOUNT size is about 2,000. Is it not indexing all 4,000 files or is this a bug?
    ____________
    Thanks,
    Skeasor

    Got Debian Linux?
    www.debian.org
  2. #2
  3. No Profile Picture
    Moderator

    Join Date
    Jun 2002
    Location
    Raleigh, NC
    Posts
    5,264
    Rep Power
    968
    If you wait a bit and check it again, does that number go up?
  4. #3
  5. No Profile Picture
    Masked Chicken
    Devshed Newbie (0 - 499 posts)

    Join Date
    Jul 2001
    Location
    Ohio/Pennsylvania
    Posts
    107
    Rep Power
    14
    Originally Posted by kiteless
    If you wait a bit and check it again, does that number go up?
    Thanks for the response Kiteless.

    Well, I finally tracked down what was happening. Somehow, JVM took up too much resource and stopped the indexing half way through. After rebooting the server, I cleared the index and started indexing again. After it was completed, the numbers appeared to be correct.

    So far, my experience of the built in version of SOLR hasnt been the best. It's hard to find people that have set something up similiar to what I am doing....oh well

    Kiteless, have you looked into nutch and solr yet? I remember you mentioned before that you havent heard of it. I was just wondering if you took a look at it yet. Seems like a lot of people are using the standalone solr approach. However, in my work environment, it may not be possible.
    ____________
    Thanks,
    Skeasor

    Got Debian Linux?
    www.debian.org
  6. #4
  7. No Profile Picture
    Moderator

    Join Date
    Jun 2002
    Location
    Raleigh, NC
    Posts
    5,264
    Rep Power
    968
    I haven't. Most of what we do is indexing database content, so we haven't needed a crawler as of yet.

IMN logo majestic logo threadwatch logo seochat tools logo