Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | ingest: generate URLs for hdl (handle.net) | Bryan Newbold | 2022-07-19 | 1 | -0/+4 |
| | |||||
* | ingest: DOAJ article URLs | Bryan Newbold | 2022-07-12 | 1 | -0/+4 |
| | |||||
* | typing: first batch of python bulk type annotations | Bryan Newbold | 2021-11-03 | 1 | -2/+10 |
| | | | | | | While these changes are more delicate than simple lint changes, this specific batch of edits and annotations was *relatively* simple, and resulted in few code changes other than function signature additions. | ||||
* | fmt (black): fatcat_tools/ | Bryan Newbold | 2021-11-02 | 1 | -27/+37 |
| | |||||
* | ingest: handle datasets, components, other ingest types | Bryan Newbold | 2021-10-14 | 1 | -1/+15 |
| | |||||
* | more consistent and defensive lower-casing of DOIs | Bryan Newbold | 2021-06-23 | 1 | -2/+2 |
| | | | | | | | After noticing more upper/lower ambiguity in production. In particular, we have some old ingest requests in sandcrawler DB, which get re-submitted/re-tried, which have capitalized DOIs in the link source id field. | ||||
* | ingest: add per-container ingest type overrides | Bryan Newbold | 2021-05-21 | 1 | -1/+17 |
| | |||||
* | ingest tool: support for setting ingest type | Bryan Newbold | 2020-11-06 | 1 | -6/+6 |
| | |||||
* | lint (flake8) tool python files | Bryan Newbold | 2020-07-01 | 1 | -1/+0 |
| | |||||
* | default to PMC ingest URLs over DOI | Bryan Newbold | 2020-02-04 | 1 | -4/+4 |
| | | | | | | | For cases where there might be both PMC and DOI urls, do the europmc.org PMC ones over DOI option. May want to turn this into a config or command-line option in the future. | ||||
* | remove 'oa_only' feature from ingest transform | Bryan Newbold | 2020-01-28 | 1 | -14/+1 |
| | | | | Refactoring to move this filter elsewhere | ||||
* | transform ingests via pmc/pmcid, not pubmed/pmid | Bryan Newbold | 2019-12-24 | 1 | -4/+4 |
| | |||||
* | update ingest request schema | Bryan Newbold | 2019-12-13 | 1 | -5/+22 |
| | | | | | This is mostly changing ingest_type from 'file' to 'pdf', and adding 'link_source'/'link_source_id', plus some small cleanups. | ||||
* | tweaks to ingest-file transform | Bryan Newbold | 2019-12-12 | 1 | -13/+7 |
| | |||||
* | project -> ingest_request_source | Bryan Newbold | 2019-11-15 | 1 | -2/+2 |
| | |||||
* | fix release.pmcid typo | Bryan Newbold | 2019-11-15 | 1 | -2/+2 |
| | |||||
* | more ingest importer comments and counts | Bryan Newbold | 2019-11-15 | 1 | -1/+1 |
| | |||||
* | add ingest request transform (and test) | Bryan Newbold | 2019-11-15 | 1 | -0/+66 |