LMPX.COM |
Home | Linux | Mysql | PHP | XML | ||
|
|
|||
From: Robert Barta Date: Mon Jun 4 10:08:45 2007 Subject: AI::Categorizer and Umlauts?
Hi,
I seem to have problems with umlauts, such as in words
Präsentation
When a document is added with
return new AI::Categorizer::Document(name => $filename,
content => $content);
to the collection, after loading and finish, the feature vector
contains only fragments of these words, such as
pr => 1
sentation => 1
Setting the locale on the shell or in Perl does not have any effect
use locale;
not even with turning on de_AT explicitly.
--
Aaaaaah, lib/AI/Categorizer/Document.pm is NOT using locale and use locale
is very, uhm, local %-)
Patching the file does not seem to break the test cases.
\rho
| Navigate in group perl.ai at sever nntp.perl.org | |
| Previous | Next |
| © No Copyright You are free to use Anything |
Site Maintained by PHP Developer
Powered By PHP Consultants |