summaryrefslogtreecommitdiffstats
path: root/python/tests/files/pubmedsample_2019.xml
diff options
context:
space:
mode:
authorMartin Czygan <martin.czygan@gmail.com>2020-01-17 14:03:00 +0100
committerMartin Czygan <martin.czygan@gmail.com>2020-01-17 14:03:00 +0100
commit53756811572bab0679cb8cee1b9de95e7b29b96a (patch)
tree8dbe3a29a79eeb5b8034a6c9d7da2952ce671a52 /python/tests/files/pubmedsample_2019.xml
parent689da76d1c759d6368d760b4a1fa942e16095a40 (diff)
downloadfatcat-53756811572bab0679cb8cee1b9de95e7b29b96a.tar.gz
fatcat-53756811572bab0679cb8cee1b9de95e7b29b96a.zip
do not normalize "en dash" in DOI
Technically, [...] DOI names may incorporate any printable characters from the Universal Character Set (UCS-2), of ISO/IEC 10646, which is the character set defined by Unicode (https://www.doi.org/doi_handbook/2_Numbering.html#2.5.1). For mostly QA reasons, we currently treat a DOI with an "en dash" as invalid.
Diffstat (limited to 'python/tests/files/pubmedsample_2019.xml')
0 files changed, 0 insertions, 0 deletions