Click here to get back home

Enabling Desktop Search to index XML files with non-XML file extensions

 HomeNewsGroups | Search | About
 microsoft.public.msn.search    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
Enabling Desktop Search to index XML files with non-XML file extensions Jose Oliver 10-04-2005
Get Chitika Premium
Posted by Jose Oliver on October 4, 2005, 12:04 pm
Please log in for more thread options


How could I configure Windows Desktop Search to index XML files which
have non-XML extension?

Here is the situation:

I have a lot of .gpx files on my hard drives which I would like Windows
Desktop Search to index. GPX files are no other than XML data files
with the .gpx file extension which are used for the interchange of GPS
device data. (Schema and more information about the file format can be
found here- http://www.topografix.com/gpx.asp).

I have found no IFilter for this file format, but I am guessing that
the XML Ifilter included in Desktop Search could do the trick.

So,

1. How could register/configure the included XML Ifilter to index .GPX
files?

2. Is there a way to configure the XML Ifilter to a particular schema
so it could recognize the title, author XML elements within these GPX
files so that they are displayed in the Desktop Search results in the
appropiate columns?



Posted by Jan Peter Stotz on October 4, 2005, 10:02 pm
Please log in for more thread options


Jose Oliver schrieb:

> How could I configure Windows Desktop Search to index XML files which
> have non-XML extension?

The mapping between file extension and used filter can be found in the
Registry.

> I have a lot of .gpx files on my hard drives which I would like Windows
> Desktop Search to index. GPX files are no other than XML data files
> with the .gpx file extension which are used for the interchange of GPS
> device data. (Schema and more information about the file format can be
> found here- http://www.topografix.com/gpx.asp).
>
> I have found no IFilter for this file format, but I am guessing that
> the XML Ifilter included in Desktop Search could do the trick.

AFAIK it should be enough to create a key HKCR\.gpx\PersistentHandler and
set it's default value to the same value as the PersistentHandler-GUID that
can be found in the under HKCR\.xml\Persistenthandler. On my system it is
.
This should be enough.

Jan


Posted by Jose Oliver on October 4, 2005, 8:49 pm
Please log in for more thread options


Nope, that did not seem to work.

I added the .GPX extension to the Desktop Search options so it gets indexed
as text, I do see the text but it seems that the xsl stylesheet is not
applied. Any thoughts?

- jose

> Jose Oliver schrieb:
>
>> How could I configure Windows Desktop Search to index XML files which
>> have non-XML extension?
>
> The mapping between file extension and used filter can be found in the
> Registry.
>
>> I have a lot of .gpx files on my hard drives which I would like Windows
>> Desktop Search to index. GPX files are no other than XML data files
>> with the .gpx file extension which are used for the interchange of GPS
>> device data. (Schema and more information about the file format can be
>> found here- http://www.topografix.com/gpx.asp).
>>
>> I have found no IFilter for this file format, but I am guessing that
>> the XML Ifilter included in Desktop Search could do the trick.
>
> AFAIK it should be enough to create a key HKCR\.gpx\PersistentHandler and
> set it's default value to the same value as the PersistentHandler-GUID
> that
> can be found in the under HKCR\.xml\Persistenthandler. On my system it is
> .
> This should be enough.
>
> Jan




Posted by Jan Peter Stotz on October 5, 2005, 10:35 am
Please log in for more thread options


Jose Oliver schrieb:

> Nope, that did not seem to work.

Check you IFilter configuration with IFilterExplorer:
http://www.citeknet.com/Products/IFilters/IFilterExplorer/tabid/62/Default.aspx

> I added the .GPX extension to the Desktop Search options so it gets indexed
> as text, I do see the text but it seems that the xsl stylesheet is not
> applied. Any thoughts?

Desktop search indexes the content of files, not the presentation. If you
want to apply an xsl stylesheet before indexing, you have to search for an
IFilter that does this or you have to write your own IFilter.

Jan


Posted by Jose Oliver on October 5, 2005, 10:23 pm
Please log in for more thread options


Hmm interesting,

I tried adding PersistentHandler key of .xml files but it does not show up
on IFilterExplorer as registered for the .GPX extension. Is there something
I might be missing?

- jose

> Jose Oliver schrieb:
>
>> Nope, that did not seem to work.
>
> Check you IFilter configuration with IFilterExplorer:
> http://www.citeknet.com/Products/IFilters/IFilterExplorer/tabid/62/Default.aspx
>
>> I added the .GPX extension to the Desktop Search options so it gets
>> indexed
>> as text, I do see the text but it seems that the xsl stylesheet is not
>> applied. Any thoughts?
>
> Desktop search indexes the content of files, not the presentation. If you
> want to apply an xsl stylesheet before indexing, you have to search for an
> IFilter that does this or you have to write your own IFilter.
>
> Jan




Similar ThreadsPosted
Searching specific file extensions July 18, 2006, 12:30 pm
Encrypting Windows Desktop Search index file January 19, 2006, 12:12 pm
MSN Desktop don't index contens of *.nws files June 6, 2005, 7:26 pm
"Search Web" bug (Index of file:///C:/? foo) March 9, 2006, 2:47 pm
MSN Search, Index File Location & Size November 1, 2005, 4:51 am
Can I index my (closed) archive.pst file July 26, 2006, 8:31 pm
Are certain file-types permanently excluded from index? June 9, 2005, 3:06 am
Accessing index file(s) by external application June 22, 2005, 4:21 am
How can I add specific folders to index, programmatically, via file edit or registry? June 14, 2006, 7:45 am
won't index files December 12, 2005, 9:13 am

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap