<feed xmlns='http://www.w3.org/2005/Atom'>
<title>fatcat, branch v0.3.2</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/fatcat/atom?h=v0.3.2</id>
<link rel='self' href='https://git.bnewbold.net/fatcat/atom?h=v0.3.2'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/'/>
<updated>2020-04-08T18:47:33Z</updated>
<entry>
<title>ingest: configurable ES index</title>
<updated>2020-04-08T18:47:33Z</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-04-08T18:47:33Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=d842983d55fd13b4a006a3caa384e68c70ce998c'/>
<id>urn:sha1:d842983d55fd13b4a006a3caa384e68c70ce998c</id>
<content type='text'>
</content>
</entry>
<entry>
<title>update bulk export instructions</title>
<updated>2020-04-07T23:10:11Z</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-04-07T23:08:53Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=94544a6b32064d299a93025b48297070db2edcd8'/>
<id>urn:sha1:94544a6b32064d299a93025b48297070db2edcd8</id>
<content type='text'>
- don't do expanded and regular release dumps
- default to sqldump_public for item name (as that is common-case)
</content>
</entry>
<entry>
<title>Merge branch 'bnewbold-pubmed-get_text' into 'master'</title>
<updated>2020-04-01T22:03:19Z</updated>
<author>
<name>bnewbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-04-01T22:03:19Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=32f195cec41459045f3d3453dad7a97b38d4e288'/>
<id>urn:sha1:32f195cec41459045f3d3453dad7a97b38d4e288</id>
<content type='text'>
beautifulsoup XML parsing: .string vs. .get_text()

See merge request webgroup/fatcat!40</content>
</entry>
<entry>
<title>Merge branch 'bnewbold-match-proposal' into 'master'</title>
<updated>2020-04-01T21:52:09Z</updated>
<author>
<name>Martin Czygan</name>
<email>martin@archive.org</email>
</author>
<published>2020-04-01T21:52:09Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=0e2025091d0c974a888a5bc741495951c952ccda'/>
<id>urn:sha1:0e2025091d0c974a888a5bc741495951c952ccda</id>
<content type='text'>
proposal: fuzzy matching

See merge request webgroup/fatcat!39</content>
</entry>
<entry>
<title>proposal: fuzzy matching</title>
<updated>2020-04-01T21:52:08Z</updated>
<author>
<name>bnewbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-04-01T21:52:08Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=4ffc55fe5a408a4a62cdcc680ff9c6fc93b3f5f5'/>
<id>urn:sha1:4ffc55fe5a408a4a62cdcc680ff9c6fc93b3f5f5</id>
<content type='text'>
</content>
</entry>
<entry>
<title>pubmed: use untranslated title if translated not available</title>
<updated>2020-04-01T19:02:45Z</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-04-01T19:02:43Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=938d2c5366d80618b839c83baadc9b5c62d10dce'/>
<id>urn:sha1:938d2c5366d80618b839c83baadc9b5c62d10dce</id>
<content type='text'>
The primary motivation for this change is that fatcat *requires* a
non-empty title for each release entity. Pubmed/Medline occasionally
indexes just a VenacularTitle with no ArticleTitle for foreign
publications, and currently those records don't end up in fatcat at all.
</content>
</entry>
<entry>
<title>importers: replace newlines in get_text() strings</title>
<updated>2020-04-01T19:02:20Z</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-04-01T19:02:20Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=f77a553350238c8ccc9c3bc0edcf47fb9dd067b3'/>
<id>urn:sha1:f77a553350238c8ccc9c3bc0edcf47fb9dd067b3</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql_dumps: stop doing redundant release dumps</title>
<updated>2020-04-01T18:29:53Z</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-04-01T18:29:53Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=6eba3d7677a7169e31dc3fb8546f1455c93805cf'/>
<id>urn:sha1:6eba3d7677a7169e31dc3fb8546f1455c93805cf</id>
<content type='text'>
</content>
</entry>
<entry>
<title>Merge branch 'bnewbold-crossref-deposit' into 'master'</title>
<updated>2020-04-01T18:13:49Z</updated>
<author>
<name>bnewbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-04-01T18:13:49Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=d25b813b2e6514196840a3225a8bb9a5d33a22bf'/>
<id>urn:sha1:d25b813b2e6514196840a3225a8bb9a5d33a22bf</id>
<content type='text'>
change crossref harvest date field

See merge request webgroup/fatcat!41</content>
</entry>
<entry>
<title>crossref: switch from index-date to update-date</title>
<updated>2020-03-31T04:23:11Z</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@robocracy.org</email>
</author>
<published>2020-03-31T03:56:04Z</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/fatcat/commit/?id=851c40143d44a73a92ff2c9556b3a63f29668c2d'/>
<id>urn:sha1:851c40143d44a73a92ff2c9556b3a63f29668c2d</id>
<content type='text'>
This goes against what the API docs recommend, but we are currently far
behind on updates and need to catch up. Other than what the docs say,
this seems to be consistent with the behavior we want.
</content>
</entry>
</feed>
