Better handling of unicode characters in sample names #292

isaacovercast · 2018-05-03T16:32:42Z

Writing pandas dataframes to file buffers fails with unicode in sample names. Each assembly step will complete, but then writing stats files will fail with:

"Encountered an unexpected error (see ./ipyrad_log.txt)
Error message is below -------------------------------
writelines() argument must be a sequence of strings"

To reproduce:
Edit any of the simulated barcodes files and swap one of the letters for ö. Run steps.

Useful/related:
pandas-dev/pandas#680
https://stackoverflow.com/questions/38786936/pandas-convert-unicode-strings-to-string

The text was updated successfully, but these errors were encountered:

isaacovercast · 2018-05-03T16:55:39Z

Use io.open and you can set the encoding to utf-8. This was useful: https://stackoverflow.com/questions/6048085/writing-unicode-text-to-a-text-file

isaacovercast · 2018-05-03T17:47:16Z

Fixed in v.0.7.24

isaacovercast closed this as completed May 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better handling of unicode characters in sample names #292

Better handling of unicode characters in sample names #292

isaacovercast commented May 3, 2018

isaacovercast commented May 3, 2018

Uh oh!

isaacovercast commented May 3, 2018

Uh oh!

Better handling of unicode characters in sample names #292

Better handling of unicode characters in sample names #292

Comments

isaacovercast commented May 3, 2018

isaacovercast commented May 3, 2018

Uh oh!

isaacovercast commented May 3, 2018

Uh oh!