| Commit message (Expand) | Author | Age | Files | Lines |
... | |
* | datacite: improve license mapping | Martin Czygan | 2020-06-30 | 1 | -9/+15 |
* | datacite: hard cast possible date value to string | Martin Czygan | 2020-06-29 | 1 | -1/+1 |
* | ingest importer: check that stage is consistent with release | Bryan Newbold | 2020-05-26 | 1 | -0/+5 |
* | Merge remote-tracking branch 'github/master' | Bryan Newbold | 2020-05-22 | 1 | -2/+2 |
|\ |
|
| * | Indentity is not the same this as equality in Python | Christian Clauss | 2020-05-14 | 1 | -2/+2 |
* | | importers: clarify handling of ApiException | Bryan Newbold | 2020-05-22 | 3 | -4/+10 |
* | | ingest importer: don't use glutton matches | Bryan Newbold | 2020-05-22 | 1 | -3/+3 |
* | | datacite: fix type error | Martin Czygan | 2020-04-22 | 1 | -1/+3 |
* | | datacite: fix a raw name constraint violation | Martin Czygan | 2020-04-20 | 1 | -0/+8 |
|/ |
|
* | consistently use raw string prefix for regex | Bryan Newbold | 2020-04-17 | 1 | -1/+1 |
* | pubmed: use untranslated title if translated not available | Bryan Newbold | 2020-04-01 | 1 | -0/+6 |
* | importers: replace newlines in get_text() strings | Bryan Newbold | 2020-04-01 | 4 | -23/+25 |
* | importers: more string/get_text swaps | Bryan Newbold | 2020-03-28 | 3 | -27/+27 |
* | pubmed: bunch of .get_text() instead of .string | Bryan Newbold | 2020-03-28 | 1 | -12/+12 |
* | Merge pull request #53 from EdwardBetts/spelling | bnewbold | 2020-03-27 | 3 | -7/+7 |
|\ |
|
| * | Correct spelling mistakes | Edward Betts | 2020-03-27 | 3 | -7/+7 |
* | | datacite: nameIdentifier corner case | Bryan Newbold | 2020-03-26 | 1 | -1/+2 |
* | | jalc: avoid meaningless pages values | Bryan Newbold | 2020-03-23 | 1 | -4/+8 |
* | | datacite: add year sanity restrictions | bnewbold | 2020-03-23 | 1 | -0/+7 |
* | | pubmed: handle multiple ReferenceList | Bryan Newbold | 2020-03-20 | 1 | -1/+4 |
* | | pubmed: update many more metadata fields | Bryan Newbold | 2020-03-19 | 1 | -0/+22 |
* | | crossref: skip stub OUP title | Bryan Newbold | 2020-03-19 | 1 | -0/+8 |
* | | Merge branch 'martin-kafka-bs4-import' into 'master' | Martin Czygan | 2020-03-10 | 2 | -1/+66 |
|\ \
| |/
|/| |
|
| * | common: use smaller batch size since XML parsing may be slow | Martin Czygan | 2020-03-10 | 1 | -1/+1 |
| * | pubmed ftp harvest and KafkaBs4XmlPusher | Martin Czygan | 2020-02-19 | 2 | -1/+66 |
* | | add some more domain/rel URL mappings | Bryan Newbold | 2020-02-22 | 1 | -0/+9 |
* | | Merge branch 'bnewbold-shadow-import' | Bryan Newbold | 2020-02-19 | 2 | -0/+196 |
|\ \ |
|
| * | | remove arabesque short wayback URL hack | Bryan Newbold | 2020-02-14 | 1 | -6/+0 |
| * | | improve shadow import file url cleanup path | Bryan Newbold | 2020-02-13 | 1 | -2/+12 |
| * | | shadow import fixes from QA testing | Bryan Newbold | 2020-02-13 | 1 | -0/+6 |
| * | | shadow import: more filtering of file_meta fields | Bryan Newbold | 2020-02-13 | 1 | -0/+10 |
| * | | basic shadow importer | Bryan Newbold | 2020-02-13 | 2 | -0/+176 |
| |/ |
|
* | | ingest import: fix edit_extra path | Bryan Newbold | 2020-02-18 | 1 | -1/+1 |
* | | ingest importer: edit_extra is a top-level key | Bryan Newbold | 2020-02-18 | 1 | -1/+1 |
* | | ingest import: allow short version of corpus names | Bryan Newbold | 2020-02-18 | 1 | -0/+3 |
* | | ingest importer: pass through link rel | Bryan Newbold | 2020-02-18 | 1 | -1/+6 |
|/ |
|
* | check ingest_request_source existance for SPN as well as ingest | Bryan Newbold | 2020-02-06 | 1 | -0/+3 |
* | additional trusted link sources | Bryan Newbold | 2020-02-06 | 1 | -0/+3 |
* | add mag and s2 as trusted link sources | Bryan Newbold | 2020-02-06 | 1 | -1/+1 |
* | ingest worker: handle missing ingest_request_source | Bryan Newbold | 2020-02-06 | 1 | -0/+3 |
* | datacite: prevent none | Martin Czygan | 2020-01-31 | 1 | -1/+1 |
* | datacite: name shall not be None | Martin Czygan | 2020-01-31 | 1 | -1/+1 |
* | datacite: add exception for https://www.micropublication.org/ | Martin Czygan | 2020-01-31 | 1 | -0/+5 |
* | datacite: do not skip records w/o date | Martin Czygan | 2020-01-31 | 1 | -2/+1 |
* | datacite: improve docstring | Martin Czygan | 2020-01-31 | 1 | -4/+4 |
* | datacite: improve date handling and minor tweak | Martin Czygan | 2020-01-30 | 1 | -19/+42 |
* | fix KafkaError worker reporting for partition errors | Bryan Newbold | 2020-01-29 | 1 | -1/+1 |
* | fix trivial typo in file importer | Bryan Newbold | 2020-01-20 | 1 | -1/+1 |
* | ingest: improve tests, support old ingest results | Bryan Newbold | 2020-01-15 | 1 | -3/+12 |
* | update ingest worker for schema tweaks | Bryan Newbold | 2020-01-15 | 1 | -8/+15 |