Please start any new threads on our new site at https://forums.sqlteam.com. We've got lots of great SQL Server experts to answer whatever question you can come up with.

 All Forums
 SQL Server 2000 Forums
 Transact-SQL (2000)
 Full Text Index Query

Author  Topic 

otto
Starting Member

21 Posts

Posted - 2002-12-02 : 13:52:09
I'm using SQL 2000 to store files in a Image column. Additionally, I'm using Full Text Indexing on that column, which requires another column that stores the file extension. SQL can index Microsoft documents (XLS, DOC, PPT) and HTM, HTML, and TXT documents.

My question has two parts:

First Q: When I search for a word or phrase, I want to know more then if it is found, but rather, how many times it was found in a given document (something like a search rating).

Is this possible with using CONTAINS?

Second Q: Is it possible to index PDF files? If so, how?

Thanks,

Otto

robvolk
Most Valuable Yak

15732 Posts

Posted - 2002-12-02 : 14:00:11
1. I think CONTAINSTABLE allows you to use a rank, or specify the top number of results arranged by their rank. Books Online describes it.

2. If the proper filter exists to process PDF files, then yes, it should work with it, but I don't know if such a filter exists, and I'm pretty sure M$ won't be developing one soon (XDocs is supposed to be an Adobe killer)

Go to Top of Page

jasper_smith
SQL Server MVP & SQLTeam MVY

846 Posts

Posted - 2002-12-02 : 16:04:15
1) As Rob says - use CONTAINSTABLE
2) There is a filter available from the adobe web site.Search for iFilter
There is an issue with it however in that it is not thread safe and MSSearch is mult-threaded - this can lead to intermitent catalog population failures. Have a look at the following KB Article
[url]http://support.microsoft.com/default.aspx?scid=KB;EN-US;Q323040[/url]


HTH
Jasper Smith
Go to Top of Page

otto
Starting Member

21 Posts

Posted - 2002-12-04 : 17:59:14
Thanks for the prompt feedback.

I've looked at CONTAINSTABLE and it works, but is limited to only one field at a time? Also, can anyone explain how they rank the results? I've tested the query and I get results back and a rank, but I need more information on how the ranking is derived.

Secondly, I've used IFilter.exe from Adobe in the past with MS Index Server. I'm looking to full text index documents that are stored in a SQL column (image column) that are PDF files. It doesn't look like IFilter.exe does that (v5 is what I just downloaded). Any insight into how I might "point" SQL at IFilter?

Thanks again for the help,

Otto

Go to Top of Page
   

- Advertisement -