| Commit message (Expand) | Author | Age | Files | Lines |
... | |
* | normalizer: filter out a specific non-ASCII character in DOI | Bryan Newbold | 2020-11-04 | 1 | -1/+3 |
* | entity updates: don't ingest JSTOR DOI prefixes | Bryan Newbold | 2020-10-23 | 1 | -0/+2 |
* | entity updater: new work update feed (ident and changelog metadata only) | Bryan Newbold | 2020-10-16 | 1 | -2/+24 |
* | chocula importer: small tweaks to update behavior | Bryan Newbold | 2020-10-08 | 1 | -8/+6 |
* | elastic transform: more preservation keepers | Bryan Newbold | 2020-10-08 | 1 | -1/+2 |
* | address spammy datacite titles | Martin Czygan | 2020-09-23 | 1 | -0/+19 |
* | ingest: default to crawl protocols.io DOIs | Bryan Newbold | 2020-09-10 | 1 | -0/+2 |
* | datacite: handle case of empty-string version | Bryan Newbold | 2020-09-10 | 1 | -1/+1 |
* | remove spurious print statement | Bryan Newbold | 2020-09-03 | 1 | -1/+0 |
* | generic file entity clean-ups as part of file_meta importer | Bryan Newbold | 2020-09-02 | 2 | -0/+50 |
* | fix comment typo (thanks martin) | Bryan Newbold | 2020-08-27 | 1 | -1/+1 |
* | fixes and test coverage for file_meta importer | Bryan Newbold | 2020-08-21 | 1 | -5/+10 |
* | initial implementation of file_meta importer | Bryan Newbold | 2020-08-21 | 2 | -0/+71 |
* | entity updater: handle doi=None case better | Bryan Newbold | 2020-08-14 | 1 | -1/+1 |
* | entity updater: es['publisher_type'] not always set | Bryan Newbold | 2020-08-14 | 1 | -1/+1 |
* | Merge branch 'bnewbold-ingest-improvements' into 'master' | Martin Czygan | 2020-08-13 | 2 | -33/+114 |
|\ |
|
| * | entity update: change big5 ingest behavior | Bryan Newbold | 2020-08-11 | 1 | -9/+15 |
| * | entity update: default to ingest non-OA works | Bryan Newbold | 2020-08-11 | 1 | -9/+10 |
| * | entity update: skip ingest of figshare+zenodo 'group' DOIs | Bryan Newbold | 2020-08-11 | 1 | -0/+15 |
| * | datacite import: figshare-specific hacks | Bryan Newbold | 2020-08-11 | 1 | -3/+3 |
| * | datacite import: refactor release_type detection into static method | Bryan Newbold | 2020-08-11 | 1 | -14/+51 |
| * | datacite import: refactor publisher-specific hacks into static method | Bryan Newbold | 2020-08-11 | 1 | -15/+29 |
| * | update crawl blocklist for SPNv2 requests which mostly fail | Bryan Newbold | 2020-08-10 | 1 | -2/+10 |
* | | harvest: datacite API yields HTTP 200 with broken JSON | Martin Czygan | 2020-08-10 | 1 | -1/+8 |
|/ |
|
* | release ES transform tweaks | Bryan Newbold | 2020-08-07 | 1 | -3/+5 |
* | chocula import update tweaks | Bryan Newbold | 2020-08-04 | 1 | -10/+14 |
* | more update keys and cases for chocula importer | Bryan Newbold | 2020-08-04 | 1 | -5/+11 |
* | fix key name mismatch in chocula importer | Bryan Newbold | 2020-08-04 | 1 | -1/+1 |
* | basic toml transform helper | Bryan Newbold | 2020-07-30 | 2 | -4/+20 |
* | Merge branch 'bnewbold-more-lint-fixes' into 'master' | Martin Czygan | 2020-07-24 | 6 | -25/+18 |
|\ |
|
| * | fix issnl typo in pubmed | Bryan Newbold | 2020-07-23 | 1 | -1/+1 |
| * | remove isascii() work around definition in importers/datacite.py | Bryan Newbold | 2020-07-23 | 1 | -7/+1 |
| * | simple lint (flake8) fixes over python codebase | Bryan Newbold | 2020-07-23 | 5 | -17/+16 |
* | | simplify in_kbart check statement | Bryan Newbold | 2020-07-23 | 1 | -1/+1 |
* | | make in_kbart transform inclusive of last year | Bryan Newbold | 2020-07-23 | 1 | -0/+9 |
|/ |
|
* | Merge branch 'martin-datacite-duplicated-author-gh-59' into 'master' | bnewbold | 2020-07-11 | 1 | -6/+60 |
|\ |
|
| * | datacite: resolve formatting issues in tests | Martin Czygan | 2020-07-10 | 33 | -133/+51 |
| |\ |
|
| * | | datacite: there should be no index gaps | Martin Czygan | 2020-07-10 | 1 | -2/+8 |
| * | | datacite: document contributor types | Martin Czygan | 2020-07-10 | 1 | -0/+25 |
| * | | wip: contrib, GH59 | Martin Czygan | 2020-07-10 | 1 | -16/+22 |
| * | | datacite: address duplicated contributor issue | Martin Czygan | 2020-07-07 | 1 | -0/+16 |
* | | | Merge branch 'martin-datacite-bugfix-sentry-44035' into 'master' | bnewbold | 2020-07-11 | 1 | -0/+4 |
|\ \ \
| |_|/
|/| | |
|
| * | | datacite: mitigate sentry #44035 | Martin Czygan | 2020-07-10 | 1 | -0/+4 |
* | | | Merge branch 'martin-arxiv-fix-http-503' into 'master' | bnewbold | 2020-07-10 | 1 | -1/+1 |
|\ \ \
| |/ /
|/| | |
|
| * | | arxiv: do retry five times of HTTP 503 | Martin Czygan | 2020-07-10 | 1 | -1/+1 |
* | | | datacite: fix attribute error | Martin Czygan | 2020-07-07 | 1 | -1/+1 |
|/ / |
|
* | | lint (flake8) tool python files | Bryan Newbold | 2020-07-01 | 33 | -130/+46 |
* | | reviewer: fix bugs in common code found by mypy | Bryan Newbold | 2020-07-01 | 1 | -2/+3 |
|/ |
|
* | add new license mappings | Bryan Newbold | 2020-06-30 | 2 | -0/+27 |
* | datacite: improve license mapping | Martin Czygan | 2020-06-30 | 1 | -9/+15 |