<feed xmlns='http://www.w3.org/2005/Atom'>
<title>fatcat/notes, branch bnewbold-pythonclient-types</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/fatcat/atom?h=bnewbold-pythonclient-types</id>
<link rel='self' href='https://git.bnewbold.net/fatcat/atom?h=bnewbold-pythonclient-types'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/'/>
<updated>2021-11-23T00:31:44+00:00</updated>
<entry>
<title>commit v0.5.0 deployment notes</title>
<updated>2021-11-23T00:31:44+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-23T00:31:44+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=b1fbd1e5e0c95e79ca0ea999ce7a8a15df46c2bd'/>
<id>urn:sha1:b1fbd1e5e0c95e79ca0ea999ce7a8a15df46c2bd</id>
<content type='text'>
</content>
</entry>
<entry>
<title>updated notes on possible cleanups</title>
<updated>2021-11-18T00:13:07+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-18T00:13:07+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=56a7306f1cebf5833238bc4d894261a050c8e3c9'/>
<id>urn:sha1:56a7306f1cebf5833238bc4d894261a050c8e3c9</id>
<content type='text'>
</content>
</entry>
<entry>
<title>document cleanups run this week</title>
<updated>2021-11-12T19:45:48+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-12T19:45:48+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=f157cc7a50e0fd9a1c79efb3c29be7d8508ffa66'/>
<id>urn:sha1:f157cc7a50e0fd9a1c79efb3c29be7d8508ffa66</id>
<content type='text'>
</content>
</entry>
<entry>
<title>Merge branch 'bnewbold-import-refactors' into 'master'</title>
<updated>2021-11-11T01:12:18+00:00</updated>
<author>
<name>bnewbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-11T01:12:18+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=6ad9d24e4d7d901d6fc394e6e91575f6acba7ff4'/>
<id>urn:sha1:6ad9d24e4d7d901d6fc394e6e91575f6acba7ff4</id>
<content type='text'>
import refactors and deprecations

Some of these are from old stale branches (the datacite subject metadata patch), but most are from yesterday and today. Sort of a hodge-podge, but the general theme is getting around to deferred cleanups and refactors specific to importer code before making some behavioral changes.

The Datacite-specific stuff could use review here.

Remove unused/deprecated/dead code:

- cdl_dash_dat and wayback_static importers, which were for specific early example entities and have been superseded by other importers
- "extid map" sqlite3 feature from several importers, was only used for initial bulk imports (and maybe should not have been used)

Refactors:

- moved a number of large datastructures out of importer code and into a dedicated static file (`biblio_lookup_tables.py`). Didn't move all, just the ones that were either generic or very large (making it hard to read code)
- shuffled around relative imports and some function names ("clean_str" vs. "clean")

Some actual behavioral changes:

- remove some Datacite-specific license slugs
- stop trying to fix double-slashes in DOIs, that was causing more harm than help (some DOIs do actually have double-slashes!)
- remove some excess metadata from datacite 'extra' fields</content>
</entry>
<entry>
<title>wayback ts cleanup: one more filter tweak</title>
<updated>2021-11-10T06:55:58+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-10T06:55:58+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=cd09c6d6bd4deef0627de4f8a8a301725db01e14'/>
<id>urn:sha1:cd09c6d6bd4deef0627de4f8a8a301725db01e14</id>
<content type='text'>
</content>
</entry>
<entry>
<title>update cleanups notes</title>
<updated>2021-11-10T05:12:54+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-10T05:12:54+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=ca3aedcefd5c3835b6fa9b685200ef12d635d22c'/>
<id>urn:sha1:ca3aedcefd5c3835b6fa9b685200ef12d635d22c</id>
<content type='text'>
</content>
</entry>
<entry>
<title>initial file/release bugfix cleanup worker and notes</title>
<updated>2021-11-10T02:50:34+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-10T02:48:37+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=fd71860047f84305a50772aa2e7494fd61cfff30'/>
<id>urn:sha1:fd71860047f84305a50772aa2e7494fd61cfff30</id>
<content type='text'>
</content>
</entry>
<entry>
<title>add notes about 'double slash in DOI' issue</title>
<updated>2021-11-10T02:49:46+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-10T02:14:58+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=23fd36a3e8505c1ed6d13367a3fb62a8bf2242d7'/>
<id>urn:sha1:23fd36a3e8505c1ed6d13367a3fb62a8bf2242d7</id>
<content type='text'>
</content>
</entry>
<entry>
<title>updates to lowercase DOI cleanup</title>
<updated>2021-11-10T00:56:28+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-10T00:56:28+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=d9284d421618742f5ecd76ba2c6d92dcefa5e5db'/>
<id>urn:sha1:d9284d421618742f5ecd76ba2c6d92dcefa5e5db</id>
<content type='text'>
</content>
</entry>
<entry>
<title>more iteration on short wayback timestamp cleanup</title>
<updated>2021-11-09T23:46:20+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2021-11-09T23:46:20+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=996b2e2084c1798126bd91dd950c063982398bec'/>
<id>urn:sha1:996b2e2084c1798126bd91dd950c063982398bec</id>
<content type='text'>
</content>
</entry>
</feed>
