| Commit message (Expand) | Author | Age | Files | Lines |
... | |
* | remove 'import *' from fatcat_tools (for transforms) | Bryan Newbold | 2021-11-02 | 1 | -2/+2 |
* | small python tweaks for annotations, imports | Bryan Newbold | 2021-11-02 | 3 | -3/+7 |
* | try some type annotations | Bryan Newbold | 2021-11-02 | 4 | -70/+79 |
* | reviewer: add annotations required by mypy | Bryan Newbold | 2021-11-02 | 1 | -2/+3 |
* | fix missing variable in fileset ingest | Bryan Newbold | 2021-11-02 | 1 | -2/+1 |
* | Merge branch 'bnewbold-import-fileset' | Bryan Newbold | 2021-11-02 | 5 | -4/+350 |
|\ |
|
| * | WIP: more fileset ingest | Bryan Newbold | 2021-10-18 | 1 | -13/+21 |
| * | WIP: rel fixes | Bryan Newbold | 2021-10-14 | 1 | -6/+6 |
| * | fileset ingest small tweaks | Bryan Newbold | 2021-10-14 | 1 | -21/+36 |
| * | initial implementation of fileset ingest importers | Bryan Newbold | 2021-10-14 | 2 | -3/+224 |
| * | ingest: handle datasets, components, other ingest types | Bryan Newbold | 2021-10-14 | 1 | -1/+15 |
| * | generic fileset importer class, with test coverage | Bryan Newbold | 2021-10-14 | 3 | -0/+88 |
* | | Merge branch 'bnewbold-match-get' | Bryan Newbold | 2021-11-02 | 1 | -3/+9 |
|\ \ |
|
| * | | access: populate thumbnail_url for PDFs | Bryan Newbold | 2021-10-18 | 1 | -3/+9 |
| |/ |
|
* / | pubmed: switch default http site to retrieve update files | Martin Czygan | 2021-10-15 | 1 | -2/+4 |
|/ |
|
* | dblp import: basic support for handles as identifiers | Bryan Newbold | 2021-10-13 | 1 | -1/+5 |
* | python: normalization/validation support for handle identifiers (hdl) | Bryan Newbold | 2021-10-13 | 1 | -0/+33 |
* | dblp import: fix typos in identifier parsing | Bryan Newbold | 2021-10-13 | 1 | -2/+1 |
* | python: partial importer utilization of new schema changes | Bryan Newbold | 2021-10-13 | 3 | -6/+18 |
* | python: implement ES schema changes | Bryan Newbold | 2021-10-13 | 1 | -4/+17 |
* | Merge branch 'bnewbold-ingest-tweaks' into 'master' | bnewbold | 2021-10-02 | 3 | -39/+106 |
|\ |
|
| * | kafka import: optional 'force-flush' mode for some importers | Bryan Newbold | 2021-10-01 | 1 | -0/+13 |
| * | new SPN web (html) importer | Bryan Newbold | 2021-10-01 | 2 | -27/+81 |
| * | ingest importer behavior tweaks | Bryan Newbold | 2021-10-01 | 1 | -8/+8 |
| * | importer common: more verbose logging (with counts) | Bryan Newbold | 2021-10-01 | 1 | -4/+4 |
* | | datacite: skip empty abstracts | Martin Czygan | 2021-10-01 | 1 | -1/+4 |
|/ |
|
* | pubmed: workaround a networking issue | Martin Czygan | 2021-09-09 | 1 | -24/+21 |
* | pubmed: add option to ftp download with lftp | Martin Czygan | 2021-09-08 | 1 | -2/+31 |
* | pubmed harvester: add basic retry logic | Martin Czygan | 2021-08-20 | 1 | -8/+21 |
* | refs: default to *not* consolidating works | Bryan Newbold | 2021-08-06 | 1 | -1/+1 |
* | refs: lint fixes | Bryan Newbold | 2021-07-27 | 1 | -0/+1 |
* | refs: support for wikipedia outbound refs, and display in tables | Bryan Newbold | 2021-07-27 | 1 | -2/+2 |
* | refs: generalize web endpoints; JSON content negotiation; openlibrary inbound... | Bryan Newbold | 2021-07-23 | 2 | -22/+57 |
* | refs: small refactors/tweaks | Bryan Newbold | 2021-07-23 | 1 | -11/+17 |
* | remove unused imports (lint) | Bryan Newbold | 2021-07-23 | 2 | -3/+2 |
* | pylint: skip pydantic import check (dynamic/extensions) | Bryan Newbold | 2021-07-23 | 1 | -8/+2 |
* | refs: refactor web paths; enrich refs as generic; remove old refs link | Bryan Newbold | 2021-07-23 | 1 | -50/+35 |
* | refs fetch: add some hacks; sort hits | Bryan Newbold | 2021-07-23 | 1 | -6/+16 |
* | fixes for newer ref index | Bryan Newbold | 2021-07-23 | 1 | -1/+1 |
* | references: refactor to point to access_options transform; comment out CSL fi... | Bryan Newbold | 2021-07-23 | 1 | -57/+8 |
* | partial access options transform for releases | Bryan Newbold | 2021-07-23 | 1 | -0/+58 |
* | initial inbound/outbound reference query helpers | Bryan Newbold | 2021-07-23 | 1 | -0/+450 |
* | pubmed: update docs | Martin Czygan | 2021-07-17 | 1 | -2/+3 |
* | pubmed: do not fail when accessing missing file | Martin Czygan | 2021-07-17 | 1 | -2/+8 |
* | pubmed: reconnect on error | Martin Czygan | 2021-07-16 | 1 | -4/+30 |
* | more consistent and defensive lower-casing of DOIs | Bryan Newbold | 2021-06-23 | 3 | -3/+8 |
* | datacite: more careful title string access; fixes sentry #88350 | Martin Czygan | 2021-06-11 | 1 | -1/+1 |
* | clean_doi() should lower-case returned DOI | Bryan Newbold | 2021-06-07 | 1 | -1/+4 |
* | ingest: swap ingest and file checks, to result in clearer stats/counts of ski... | Bryan Newbold | 2021-06-03 | 1 | -2/+2 |
* | ingest: don't accept mag and s2 URLs | Bryan Newbold | 2021-06-03 | 1 | -4/+4 |