| Commit message (Expand) | Author | Age | Files | Lines |
* | fileset ingest: handle missing/partial file-level metadata | Bryan Newbold | 2022-04-05 | 1 | -3/+3 |
* | ingest importer: improved extra/edit_extra code flow | Bryan Newbold | 2022-04-05 | 1 | -20/+13 |
* | fileset ingest: remove a TODO | Bryan Newbold | 2022-04-04 | 1 | -1/+0 |
* | filesets: typo bugfix, and test 'mimetype' on entity, not extra | Bryan Newbold | 2022-04-04 | 1 | -1/+1 |
* | fileset ingest: fix mimetype handling | Bryan Newbold | 2022-03-31 | 1 | -4/+5 |
* | bugfix: logic flow in fileset release checking | Bryan Newbold | 2022-03-23 | 1 | -3/+6 |
* | single-file variant of fileset importer for dataset attempts | Bryan Newbold | 2022-03-23 | 2 | -0/+202 |
* | fix typo in fileset comparison helper | Bryan Newbold | 2022-03-23 | 1 | -1/+1 |
* | ingest fileset fixes, and some test coverage | Bryan Newbold | 2022-03-23 | 2 | -13/+30 |
* | dataset ingest: JSON object fixes | Bryan Newbold | 2022-03-22 | 1 | -5/+5 |
* | datacite importer: skip container_id for some repository sources | Bryan Newbold | 2022-02-09 | 1 | -0/+34 |
* | doaj importer: TODO note to skip some larger publishers | Bryan Newbold | 2022-02-09 | 1 | -0/+4 |
* | crossref importer: skip affiliations lacking 'name' | Bryan Newbold | 2021-12-15 | 1 | -0/+3 |
* | chocula importer: handle not-upper-case ISSNs | Bryan Newbold | 2021-11-30 | 1 | -2/+6 |
* | chocula importer: handle broken ISSNs in extra metadata | Bryan Newbold | 2021-11-30 | 1 | -2/+7 |
* | chocula importer: tweak counting, conditions for doing updates | Bryan Newbold | 2021-11-30 | 1 | -15/+7 |
* | chocula importer: move issne/issnp 'extra' to top-level fields if doing updates | Bryan Newbold | 2021-11-30 | 1 | -0/+6 |
* | chocula: don't do name cleanups in importer | Bryan Newbold | 2021-11-30 | 1 | -8/+2 |
* | codespell fixes in python code (comments) | Bryan Newbold | 2021-11-24 | 1 | -2/+2 |
* | Merge branch 'bnewbold-import-refactors' into 'master' | bnewbold | 2021-11-11 | 16 | -1380/+146 |
|\ |
|
| * | refactor importer metadata tables into separate file; move some helpers around | Bryan Newbold | 2021-11-10 | 8 | -621/+25 |
| * | importers: refactor imports of clean() and other normalization helpers | Bryan Newbold | 2021-11-10 | 12 | -95/+104 |
| * | remove cdl_dash_dat and wayback_static importers | Bryan Newbold | 2021-11-10 | 3 | -510/+0 |
| * | datacite import: store less subject metadata | Bryan Newbold | 2021-11-10 | 1 | -1/+7 |
| * | importers: use clean_doi() in many more (all?) importers | Bryan Newbold | 2021-11-09 | 6 | -12/+29 |
| * | remove deprecated extid sqlite3 lookup table feature from importers | Bryan Newbold | 2021-11-09 | 3 | -160/+0 |
* | | Merge branch 'bnewbold-cleanups-nov2021' into 'master' | bnewbold | 2021-11-11 | 1 | -0/+9 |
|\ \ |
|
| * | | imports: generic file cleanup removes exact duplicate URLs | Bryan Newbold | 2021-11-09 | 1 | -0/+9 |
| |/ |
|
* / | pubmed: allow updates if PMCID does not exist yet | Bryan Newbold | 2021-11-10 | 1 | -1/+6 |
|/ |
|
* | datacite importer: remove unused 'year_only' variable | Bryan Newbold | 2021-11-03 | 1 | -2/+3 |
* | datacite: add comment about potential date parsing bug | Bryan Newbold | 2021-11-03 | 1 | -0/+1 |
* | datacite importer: dateparser.date.DateDataParser() | Bryan Newbold | 2021-11-03 | 1 | -1/+1 |
* | more involved type wrangling and fixes for importers | Bryan Newbold | 2021-11-03 | 3 | -12/+14 |
* | typing: relatively simple type check fixes | Bryan Newbold | 2021-11-03 | 14 | -87/+82 |
* | typing: initial annotations on importers | Bryan Newbold | 2021-11-03 | 22 | -274/+443 |
* | importers: remove unused __main__ routine | Bryan Newbold | 2021-11-03 | 4 | -19/+0 |
* | lint: resolve existing mypy type errors | Bryan Newbold | 2021-11-02 | 3 | -22/+27 |
* | re-fix some lint issues after big 'fmt' | Bryan Newbold | 2021-11-02 | 1 | -2/+2 |
* | fmt (black): fatcat_tools/ | Bryan Newbold | 2021-11-02 | 22 | -2115/+2578 |
* | python: isort everything | Bryan Newbold | 2021-11-02 | 17 | -41/+70 |
* | arabesque import 'hit' field is 1/0, not true/false | Bryan Newbold | 2021-11-02 | 1 | -2/+2 |
* | lint: simple, safe inline lint fixes | Bryan Newbold | 2021-11-02 | 12 | -22/+21 |
* | lint/fmt: remove all 'import *' | Bryan Newbold | 2021-11-02 | 5 | -21/+41 |
* | re-fmt all the fatcat_tools __init__ files for readability | Bryan Newbold | 2021-11-02 | 1 | -17/+39 |
* | small python tweaks for annotations, imports | Bryan Newbold | 2021-11-02 | 2 | -2/+6 |
* | try some type annotations | Bryan Newbold | 2021-11-02 | 2 | -55/+63 |
* | fix missing variable in fileset ingest | Bryan Newbold | 2021-11-02 | 1 | -2/+1 |
* | WIP: more fileset ingest | Bryan Newbold | 2021-10-18 | 1 | -13/+21 |
* | WIP: rel fixes | Bryan Newbold | 2021-10-14 | 1 | -6/+6 |
* | fileset ingest small tweaks | Bryan Newbold | 2021-10-14 | 1 | -21/+36 |