Accented text not indexed

Do you want to see something in X1? Do you dislike something about X1? Let us know!

Moderator: Mods

Accented text not indexed

Postby gt13 » Mon Apr 18, 2005 3:01 pm

There are severe problems with accented letters that probably prevent users from a lot of countries (with accented letters language) to use X1.

It seems that X1 is not able to deal with accented letters in text files!

Yes, it is hard to believe, but you can reproduce the problem by downloading
http://ct13.free.fr/X1/Indexation_test.zip (3 kB)
Decompress this archive in a place where it is indexed by X1. You will get 3 files with the same content:
a) one in MS Word format
b) one text file with Windows format
c) one text file with DOS format
These files content the words: unusualword and eczéma.

Once indexed, do the searches:
1) Search "unusualword ecz". You will get the 3 files without problem.
2) Search "unusualword eczé". You will get only the Word file.
3) Search "unusualword ecze". You will get only the Word file.

To be of interest for accented language people, X1 should return the 3 files in the 3 searches, or at least the first two files (assuming that the DOS format is obsolescent, even if I have a lot of such files on my disk…).

I tried to circumvent the problem but I did not find a solution.
There is no joker able to replace letters, and searches like "unusualword ecz*ma" or "unusualword ecz?ma" do not work (it could have been a replacement solution until a better one arrives.

Gerard

I use version 5.2 Beta Release 1 (Build 1852al) (Released Friday, April 8, 2005), Windows 2000 SP4.
The same problem occurs with Yahoo Desktop Search (version 1500zk), Windows XP SP1.
gt13
X1 Power User
X1 Power User
 
Posts: 64
Joined: Sat Apr 17, 2004 10:09 am
Location: Marseille, France

Postby noel » Mon Apr 18, 2005 4:12 pm

Hello,

Thanks for reporting this and the sample files.

X1 does not index "é" as a unique character. However, it should index "é" as an "e."

The Microsoft Word Document comes up correctly. I am not sure why the 2 .txt files do not appear. We will open a new bug for this issue.
Noel Ferreria
noel
X1 Super User
X1 Super User
 
Posts: 478
Joined: Mon Nov 17, 2003 2:05 pm
Location: Pasadena, CA

Postby gt13 » Sun Jul 10, 2005 2:52 am

Text files with accented letters not indexed...
Still no solution to this problem?

Gérard
gt13
X1 Power User
X1 Power User
 
Posts: 64
Joined: Sat Apr 17, 2004 10:09 am
Location: Marseille, France

Postby Rob » Mon Jul 11, 2005 6:24 pm

gt13 wrote:Text files with accented letters not indexed...
Still no solution to this problem?

Gérard


Hi Gérard,

It's on the roadmap and could happen this quarter or next. I'll have a better idea in a few days and will let everyone know if it's farther out than that.
Rob McClinton
Director of Customer Care, X1

My manager is Josh Jacobs
President, X1
Rob
X1 Rep
X1 Rep
 
Posts: 69
Joined: Tue Mar 23, 2004 4:42 pm
Location: Pasadena

Postby gt13 » Sat Sep 24, 2005 11:42 pm

OK
Please let us know...
Gérard
gt13
X1 Power User
X1 Power User
 
Posts: 64
Joined: Sat Apr 17, 2004 10:09 am
Location: Marseille, France


Return to Feature Requests and Gripes

Who is online

Users browsing this forum: Google [Bot] and 6 guests

cron