Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | vastly improve entity_to_dict() speed | Bryan Newbold | 2019-01-28 | 1 | -1/+9 |
| | |||||
* | fatcat -> fatcat_release ES index | Bryan Newbold | 2019-01-28 | 2 | -10/+13 |
| | |||||
* | transform and import fixes/tweaks | Bryan Newbold | 2019-01-25 | 4 | -18/+51 |
| | |||||
* | update journal meta import/transform | Bryan Newbold | 2019-01-25 | 2 | -132/+194 |
| | |||||
* | grobid import extra metadata tweaks | Bryan Newbold | 2019-01-24 | 1 | -6/+7 |
| | |||||
* | refactor _get_editgroup => get_editgroup_id | Bryan Newbold | 2019-01-24 | 2 | -5/+6 |
| | |||||
* | refactor make_rel_url | Bryan Newbold | 2019-01-24 | 3 | -29/+66 |
| | |||||
* | tweak crossref import, and update tests | Bryan Newbold | 2019-01-24 | 1 | -11/+27 |
| | |||||
* | allow importing contrib/refs lists | Bryan Newbold | 2019-01-24 | 1 | -5/+13 |
| | | | | | | The motivation here isn't really to support these gigantic lists on principle, but to be able to ingest large corpuses without having to decide whether to filter out or crop such lists. | ||||
* | notes on refactoring container 'extra' | Bryan Newbold | 2019-01-24 | 1 | -0/+79 |
| | |||||
* | importer bugfixes | Bryan Newbold | 2019-01-23 | 3 | -8/+14 |
| | |||||
* | start changes to release ES schema | Bryan Newbold | 2019-01-23 | 1 | -30/+90 |
| | |||||
* | bunch of crossref import tweaks (need tests) | Bryan Newbold | 2019-01-23 | 1 | -50/+43 |
| | |||||
* | try to fix any_abstract | Bryan Newbold | 2019-01-23 | 1 | -1/+1 |
| | |||||
* | clean() checks if it returns null-length string | Bryan Newbold | 2019-01-23 | 1 | -1/+5 |
| | |||||
* | matched importer: bezerk mode to skip file updates | Bryan Newbold | 2019-01-23 | 1 | -11/+5 |
| | |||||
* | ftfy all over (needs Pipfile.lock) | Bryan Newbold | 2019-01-23 | 7 | -39/+74 |
| | |||||
* | more tests; fix some importer behavior | Bryan Newbold | 2019-01-23 | 2 | -37/+43 |
| | |||||
* | improve changelog tests | Bryan Newbold | 2019-01-23 | 1 | -1/+0 |
| | |||||
* | refactor remaining importers | Bryan Newbold | 2019-01-22 | 7 | -328/+297 |
| | |||||
* | refactored crossref importer to new style | Bryan Newbold | 2019-01-22 | 3 | -89/+166 |
| | |||||
* | new importer API interfaces | Bryan Newbold | 2019-01-22 | 2 | -0/+181 |
| | |||||
* | crossref importer updates | Bryan Newbold | 2019-01-22 | 1 | -19/+78 |
| | |||||
* | remove coden and abbrev from python tools | Bryan Newbold | 2019-01-21 | 1 | -2/+0 |
| | |||||
* | include filesets and webcaptures in exports | Bryan Newbold | 2019-01-18 | 1 | -1/+1 |
| | |||||
* | fix typo in elastic transform code | Bryan Newbold | 2019-01-18 | 1 | -1/+1 |
| | |||||
* | more 'true' -> True query param fixes | Bryan Newbold | 2019-01-18 | 4 | -4/+4 |
| | |||||
* | state in elasticsearch (and deleted/redirects) | Bryan Newbold | 2019-01-18 | 1 | -2/+8 |
| | |||||
* | issn => journal_metadata in several places | Bryan Newbold | 2019-01-17 | 2 | -6/+6 |
| | |||||
* | use full-on autoaccept mode | Bryan Newbold | 2019-01-11 | 5 | -12/+16 |
| | | | | | | | | Now that editor_id is infered from token, don't *need* to create ahead of time. This backend change simplifies things greatly (either update an existing editgroup, or create new and *only* include entities in the batch transaction), at the cost of being able to configure the editgroup in any way, including setting a description. | ||||
* | Merge branch 'bnewbold-crude-auth' | Bryan Newbold | 2019-01-08 | 11 | -36/+131 |
|\ | | | | | | | | | Fixed a conflict in: python/fatcat_export.py | ||||
| * | workers do API-passing (not URI-passing) | Bryan Newbold | 2019-01-08 | 2 | -9/+7 |
| | | |||||
| * | importers and tests all use new api-passing | Bryan Newbold | 2019-01-08 | 6 | -10/+44 |
| | | |||||
| * | start updating importer auth with crossref importer | Bryan Newbold | 2019-01-08 | 3 | -14/+40 |
| | | |||||
| * | entity_to_json -> entity_to_dict | Bryan Newbold | 2019-01-08 | 2 | -2/+2 |
| | | |||||
| * | start refactoring API object passing | Bryan Newbold | 2019-01-08 | 2 | -0/+41 |
| | | |||||
| * | don't need to supply editor_id now | Bryan Newbold | 2018-12-31 | 2 | -8/+4 |
| | | |||||
* | | check request status codes idiomatically | Bryan Newbold | 2018-12-29 | 2 | -3/+3 |
|/ | |||||
* | python impl of API ident harmonization | Bryan Newbold | 2018-12-24 | 6 | -36/+36 |
| | |||||
* | implement release_year (and rustfmt) | Bryan Newbold | 2018-12-24 | 3 | -9/+18 |
| | |||||
* | do actually require title for crossref import | Bryan Newbold | 2018-12-01 | 1 | -3/+3 |
| | |||||
* | fix file extraction (and transforms) | Bryan Newbold | 2018-11-26 | 1 | -6/+6 |
| | |||||
* | clean up harvester comments/docs | Bryan Newbold | 2018-11-21 | 3 | -50/+31 |
| | |||||
* | crossref importer doesn't require author/title attributes | Bryan Newbold | 2018-11-21 | 1 | -6/+6 |
| | |||||
* | crossref importer checks for existing DOIs | Bryan Newbold | 2018-11-21 | 2 | -4/+19 |
| | |||||
* | use isoformat() to format dates | Bryan Newbold | 2018-11-21 | 3 | -5/+6 |
| | | | | This shouldn't change behavior; it's just more consistent. | ||||
* | grobid importer: release_date as a date | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix loop_sleep typo | Bryan Newbold | 2018-11-21 | 2 | -2/+2 |
| | |||||
* | fix datacite DOI extraction | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix OAI-PMH name/finished message | Bryan Newbold | 2018-11-21 | 1 | -1/+6 |
| |