| Commit message (Expand) | Author | Age | Files | Lines |
* | datacite import: figshare-specific hacks | Bryan Newbold | 2020-08-11 | 1 | -3/+3 |
* | datacite import: refactor release_type detection into static method | Bryan Newbold | 2020-08-11 | 1 | -14/+51 |
* | datacite import: refactor publisher-specific hacks into static method | Bryan Newbold | 2020-08-11 | 1 | -15/+29 |
* | chocula import update tweaks | Bryan Newbold | 2020-08-04 | 1 | -10/+14 |
* | more update keys and cases for chocula importer | Bryan Newbold | 2020-08-04 | 1 | -5/+11 |
* | fix key name mismatch in chocula importer | Bryan Newbold | 2020-08-04 | 1 | -1/+1 |
* | fix issnl typo in pubmed | Bryan Newbold | 2020-07-23 | 1 | -1/+1 |
* | remove isascii() work around definition in importers/datacite.py | Bryan Newbold | 2020-07-23 | 1 | -7/+1 |
* | simple lint (flake8) fixes over python codebase | Bryan Newbold | 2020-07-23 | 5 | -17/+16 |
* | Merge branch 'martin-datacite-duplicated-author-gh-59' into 'master' | bnewbold | 2020-07-11 | 1 | -6/+60 |
|\ |
|
| * | datacite: resolve formatting issues in tests | Martin Czygan | 2020-07-10 | 16 | -72/+28 |
| |\ |
|
| * | | datacite: there should be no index gaps | Martin Czygan | 2020-07-10 | 1 | -2/+8 |
| * | | datacite: document contributor types | Martin Czygan | 2020-07-10 | 1 | -0/+25 |
| * | | wip: contrib, GH59 | Martin Czygan | 2020-07-10 | 1 | -16/+22 |
| * | | datacite: address duplicated contributor issue | Martin Czygan | 2020-07-07 | 1 | -0/+16 |
* | | | datacite: mitigate sentry #44035 | Martin Czygan | 2020-07-10 | 1 | -0/+4 |
| |/
|/| |
|
* | | datacite: fix attribute error | Martin Czygan | 2020-07-07 | 1 | -1/+1 |
* | | lint (flake8) tool python files | Bryan Newbold | 2020-07-01 | 16 | -72/+27 |
|/ |
|
* | add new license mappings | Bryan Newbold | 2020-06-30 | 2 | -0/+27 |
* | datacite: improve license mapping | Martin Czygan | 2020-06-30 | 1 | -9/+15 |
* | datacite: hard cast possible date value to string | Martin Czygan | 2020-06-29 | 1 | -1/+1 |
* | ingest importer: check that stage is consistent with release | Bryan Newbold | 2020-05-26 | 1 | -0/+5 |
* | Merge remote-tracking branch 'github/master' | Bryan Newbold | 2020-05-22 | 1 | -2/+2 |
|\ |
|
| * | Indentity is not the same this as equality in Python | Christian Clauss | 2020-05-14 | 1 | -2/+2 |
* | | importers: clarify handling of ApiException | Bryan Newbold | 2020-05-22 | 3 | -4/+10 |
* | | ingest importer: don't use glutton matches | Bryan Newbold | 2020-05-22 | 1 | -3/+3 |
* | | datacite: fix type error | Martin Czygan | 2020-04-22 | 1 | -1/+3 |
* | | datacite: fix a raw name constraint violation | Martin Czygan | 2020-04-20 | 1 | -0/+8 |
|/ |
|
* | consistently use raw string prefix for regex | Bryan Newbold | 2020-04-17 | 1 | -1/+1 |
* | pubmed: use untranslated title if translated not available | Bryan Newbold | 2020-04-01 | 1 | -0/+6 |
* | importers: replace newlines in get_text() strings | Bryan Newbold | 2020-04-01 | 4 | -23/+25 |
* | importers: more string/get_text swaps | Bryan Newbold | 2020-03-28 | 3 | -27/+27 |
* | pubmed: bunch of .get_text() instead of .string | Bryan Newbold | 2020-03-28 | 1 | -12/+12 |
* | Merge pull request #53 from EdwardBetts/spelling | bnewbold | 2020-03-27 | 3 | -7/+7 |
|\ |
|
| * | Correct spelling mistakes | Edward Betts | 2020-03-27 | 3 | -7/+7 |
* | | datacite: nameIdentifier corner case | Bryan Newbold | 2020-03-26 | 1 | -1/+2 |
* | | jalc: avoid meaningless pages values | Bryan Newbold | 2020-03-23 | 1 | -4/+8 |
* | | datacite: add year sanity restrictions | bnewbold | 2020-03-23 | 1 | -0/+7 |
* | | pubmed: handle multiple ReferenceList | Bryan Newbold | 2020-03-20 | 1 | -1/+4 |
* | | pubmed: update many more metadata fields | Bryan Newbold | 2020-03-19 | 1 | -0/+22 |
* | | crossref: skip stub OUP title | Bryan Newbold | 2020-03-19 | 1 | -0/+8 |
* | | Merge branch 'martin-kafka-bs4-import' into 'master' | Martin Czygan | 2020-03-10 | 2 | -1/+66 |
|\ \
| |/
|/| |
|
| * | common: use smaller batch size since XML parsing may be slow | Martin Czygan | 2020-03-10 | 1 | -1/+1 |
| * | pubmed ftp harvest and KafkaBs4XmlPusher | Martin Czygan | 2020-02-19 | 2 | -1/+66 |
* | | add some more domain/rel URL mappings | Bryan Newbold | 2020-02-22 | 1 | -0/+9 |
* | | Merge branch 'bnewbold-shadow-import' | Bryan Newbold | 2020-02-19 | 2 | -0/+196 |
|\ \ |
|
| * | | remove arabesque short wayback URL hack | Bryan Newbold | 2020-02-14 | 1 | -6/+0 |
| * | | improve shadow import file url cleanup path | Bryan Newbold | 2020-02-13 | 1 | -2/+12 |
| * | | shadow import fixes from QA testing | Bryan Newbold | 2020-02-13 | 1 | -0/+6 |
| * | | shadow import: more filtering of file_meta fields | Bryan Newbold | 2020-02-13 | 1 | -0/+10 |