Help - Search - Members - Calendar
Full Version: character encoding help
HTMLHelp Forums > Web Authoring > General Web Design
html22
I need help with my website. Im using htdig for my website's search engine. And my problem is everytime i search for like spanish term for some reason this char set "Ã" is not recognize. Its giving me some funny characters.

Here an example of the website.

http://toxtown.nlm.nih.gov/cgi-bin/htsearch, try to search for "casa"

See the 1st title "Tox Town en español - Radón", Radón should spell like Radón.

Any help will be appreciated. Thanks!
Darin McGrew
According to the ht://Dig FAQ, it doesn't support UTF-8 documents. If you want to use ht://Dig to index your site, then you need to convert to an encoding that it supports.
Brian Chandler
QUOTE
According to the ht://Dig FAQ, it doesn't support UTF-8 documents. If you want to use ht://Dig to index your site, then you need to convert to an encoding that it supports.


Slightly backwards answer, it seems to me. If they don't support UTF-8, the thing you need to do is find someone who does.
html22
QUOTE(Darin McGrew @ Sep 19 2007, 05:35 PM) *

According to the ht://Dig FAQ, it doesn't support UTF-8 documents. If you want to use ht://Dig to index your site, then you need to convert to an encoding that it supports.



Hi thanks much. Can you suggest any encoding that supports ht://DIG??? thanks again!
Brian Chandler
QUOTE
Hi thanks much. Can you suggest any encoding that supports ht://DIG??? thanks again!


Any 8-bit encoding, such as ISO-8859-1 for Spanish. But reverting to that from UTF-8 is a huge step backwards.
html22
QUOTE(Brian Chandler @ Sep 20 2007, 09:02 AM) *

QUOTE
Hi thanks much. Can you suggest any encoding that supports ht://DIG??? thanks again!


Any 8-bit encoding, such as ISO-8859-1 for Spanish. But reverting to that from UTF-8 is a huge step backwards.



thanks again. Is that the best way to do it tho?
Darin McGrew
Converting from UTF-8 to ISO-8859-1 to accomodate ht://Dig would be a step backwards. Upgrading to a search tool that understands UTF-8 would be a step forwards.
html22
QUOTE(Darin McGrew @ Sep 20 2007, 08:18 PM) *

Converting from UTF-8 to ISO-8859-1 to accomodate ht://Dig would be a step backwards. Upgrading to a search tool that understands UTF-8 would be a step forwards.



Hi we don't want to change the htdig site search engone tool. It was working fine with old version of our site( it was searching both english and spanish words). Do you have any idea what might be the cause of the error? thanks again!
Darin McGrew
UTF-8 uses multiple bytes to encode some characters. ht://Dig does not support multi-byte characters. Presumably, you converted your content to UTF-8 (from ISO-8859-1?), and ht://Dig could no longer index it properly.
This is a "lo-fi" version of our main content. To view the full version with more information, formatting and images, please click here.
Invision Power Board © 2001-2010 Invision Power Services, Inc.