Just to conclude this thread: On 12.06.07, Russell Steicke wrote:
On Mon, Jun 11, 2007 at 04:02:03PM +0200, Joerg Lehmann wrote: ...
There is a small problem with that one, though: If you don't use the C locale, string.letters may also contain non-ASCII characters and then "s in string.letters" fails for any Unicode string. In the present case this means it fails always. This seems to be one of the cases where Python's unicode support is still lacking...
Yes, I hadn't thought of that. Does string.punctuation suffer from the same problem? Are there punctuation characters in Unicode outside the ASCII range? (That question may reveal that I don't fully understand the problem...)
I now checked in a patch based on the use of string.punctuation. Wheres this is not optimal neither, it should still work most of the time (tm). Btw, did you test the SVN version a bit. I'm really interested in some feedback, because so far there was not so much... Anyway, I'd like to release the new code rather soon. Jörg