<feed xmlns='http://www.w3.org/2005/Atom'>
<title>fatcat/python/tests/files/datacite, branch v0.3.3</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/fatcat/atom?h=v0.3.3</id>
<link rel='self' href='https://git.bnewbold.net/fatcat/atom?h=v0.3.3'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/'/>
<updated>2020-08-11T22:32:28+00:00</updated>
<entry>
<title>datacite importer: update test cases for 'Additional file' as component, not stub</title>
<updated>2020-08-11T22:32:28+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-08-11T22:24:41+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=a95b382a7add348c15bca4ed98729e47b17df11a'/>
<id>urn:sha1:a95b382a7add348c15bca4ed98729e47b17df11a</id>
<content type='text'>
</content>
</entry>
<entry>
<title>datacite import: figshare-specific hacks</title>
<updated>2020-08-11T22:32:28+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-08-11T00:35:16+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=ff05a03a3874e17557174d3534a1c2d11e01c4a6'/>
<id>urn:sha1:ff05a03a3874e17557174d3534a1c2d11e01c4a6</id>
<content type='text'>
</content>
</entry>
<entry>
<title>datacite: adjust tests</title>
<updated>2020-07-10T16:29:47+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-07-10T16:29:47+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=fdf1028c19b0623e30b91e49ffa65ed130dcfdc1'/>
<id>urn:sha1:fdf1028c19b0623e30b91e49ffa65ed130dcfdc1</id>
<content type='text'>
</content>
</entry>
<entry>
<title>wip: contrib, GH59</title>
<updated>2020-07-09T22:50:34+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-07-09T22:50:34+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=40f77b78aa331ca67b510dfece77e6a6000f8c2f'/>
<id>urn:sha1:40f77b78aa331ca67b510dfece77e6a6000f8c2f</id>
<content type='text'>
</content>
</entry>
<entry>
<title>datacite: address duplicated contributor issue</title>
<updated>2020-07-07T00:08:26+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-07-07T00:08:26+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=fcc6f24a95a7b77bda4ec813daecc2b737a82412'/>
<id>urn:sha1:fcc6f24a95a7b77bda4ec813daecc2b737a82412</id>
<content type='text'>
Use string comparison.

* https://fatcat.wiki/release/spjysmrnsrgyzgq6ise5o44rlu/contribs
* https://api.datacite.org/dois/10.25940/roper-31098406
</content>
</entry>
<entry>
<title>datacite: fix type error</title>
<updated>2020-04-22T20:25:36+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-04-22T20:25:36+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=e0baeade7924019c5bbd27d9a7c116a1e26854fc'/>
<id>urn:sha1:e0baeade7924019c5bbd27d9a7c116a1e26854fc</id>
<content type='text'>
Up to now, we expected the description to be a string or list. Add
handling for int as well.

First appeared: Apr 22 19:58:39.
</content>
</entry>
<entry>
<title>datacite: fix a raw name constraint violation</title>
<updated>2020-04-20T18:52:10+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-04-20T18:52:10+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=7c6febf20c84dd4f5778e1fb02369456f7dad344'/>
<id>urn:sha1:7c6febf20c84dd4f5778e1fb02369456f7dad344</id>
<content type='text'>
It was possible that contribs got added which had no raw name. One
example would be a name consisting of whitespace only.

This fix adds a final check for this case.
</content>
</entry>
<entry>
<title>datacite: add exception for https://www.micropublication.org/</title>
<updated>2020-01-31T00:44:46+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-01-31T00:44:46+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=a42206d2603e28f1311ac3873dc168c78eabffee'/>
<id>urn:sha1:a42206d2603e28f1311ac3873dc168c78eabffee</id>
<content type='text'>
</content>
</entry>
<entry>
<title>datacite: improve date handling and minor tweak</title>
<updated>2020-01-30T12:36:01+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-01-30T12:36:01+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=7dec2d1560ebf5ca6d0d337eb246fe345f6ec0bb'/>
<id>urn:sha1:7dec2d1560ebf5ca6d0d337eb246fe345f6ec0bb</id>
<content type='text'>
Records from https://www.micropublication.org/ did not have a date in
FC, although raw data contained date strings - they were not using the
finer-grained "attributes.date" but "attributes.published" and/or
"attributes.publicationYear".

Support for those fields has been added, including a test case.

During this test (#30) a processing gap for names became clear (author
may have "given_name" and "surname", but no "name"). This bug has been
fixed, too.
</content>
</entry>
<entry>
<title>do not normalize "en dash" in DOI</title>
<updated>2020-01-17T13:03:00+00:00</updated>
<author>
<name>Martin Czygan</name>
<email>martin.czygan@gmail.com</email>
</author>
<published>2020-01-17T13:03:00+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=53756811572bab0679cb8cee1b9de95e7b29b96a'/>
<id>urn:sha1:53756811572bab0679cb8cee1b9de95e7b29b96a</id>
<content type='text'>
Technically, [...] DOI names may incorporate any printable characters
from the Universal Character Set (UCS-2), of ISO/IEC 10646, which is the
character set defined by Unicode (https://www.doi.org/doi_handbook/2_Numbering.html#2.5.1).

For mostly QA reasons, we currently treat a DOI with an "en dash" as
invalid.
</content>
</entry>
</feed>
