aboutsummaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/transforms
Commit message (Expand)AuthorAgeFilesLines
* Merge branch 'bnewbold-import-fileset'Bryan Newbold2021-11-021-1/+15
|\
| * ingest: handle datasets, components, other ingest typesBryan Newbold2021-10-141-1/+15
* | access: populate thumbnail_url for PDFsBryan Newbold2021-10-181-3/+9
|/
* python: implement ES schema changesBryan Newbold2021-10-131-4/+17
* refs: generalize web endpoints; JSON content negotiation; openlibrary inbound...Bryan Newbold2021-07-231-0/+2
* remove unused imports (lint)Bryan Newbold2021-07-231-1/+1
* partial access options transform for releasesBryan Newbold2021-07-231-0/+58
* more consistent and defensive lower-casing of DOIsBryan Newbold2021-06-231-2/+2
* small python lint fixes (no behavior change)Bryan Newbold2021-05-251-1/+1
* ingest: add per-container ingest type overridesBryan Newbold2021-05-211-1/+17
* transforms: fix 'display_ame' typoBryan Newbold2021-04-191-2/+2
* prefer contrib.creator.display_name over contrib.raw_nameBryan Newbold2021-04-122-4/+7
* ES schema updates: doc_index_ts as a str, not datetimeBryan Newbold2021-04-061-4/+4
* container search schema: preservation stats, new fieldsBryan Newbold2021-04-061-2/+18
* release ES: add discipline fieldBryan Newbold2021-04-061-0/+2
* ES schemas: add doc_index_ts to all mappingsBryan Newbold2021-04-061-0/+4
* elasticsearch: simple new dblp and doaj fieldsBryan Newbold2021-01-201-0/+4
* bug fix: is_preserved should always be boolBryan Newbold2020-12-171-2/+2
* fix indentationBryan Newbold2020-12-161-2/+2
* have release elasticsearch transform count webcaptures and filesets towards p...Bryan Newbold2020-12-161-26/+57
* small release_to_elasticsearch refactorsBryan Newbold2020-12-161-7/+12
* refactor release_to_elasticsearch transformBryan Newbold2020-12-161-131/+148
* if a release has DOAJ article id, count as OABryan Newbold2020-11-191-0/+3
* ingest tool: support for setting ingest typeBryan Newbold2020-11-061-6/+6
* elastic transform: more preservation keepersBryan Newbold2020-10-081-1/+2
* release ES transform tweaksBryan Newbold2020-08-071-3/+5
* basic toml transform helperBryan Newbold2020-07-302-4/+20
* simplify in_kbart check statementBryan Newbold2020-07-231-1/+1
* make in_kbart transform inclusive of last yearBryan Newbold2020-07-231-0/+9
* lint (flake8) tool python filesBryan Newbold2020-07-014-18/+10
* ES schema: add best_url to file schemaBryan Newbold2020-06-041-0/+12
* improve citeproc/CSL web interfaceBryan Newbold2020-03-251-6/+12
* Merge branch 'bnewbold-elastic-v03b'Bryan Newbold2020-02-262-46/+198
|\
| * improve is_oa flag accuracyBryan Newbold2020-02-261-8/+4
| * ES container last tweaksBryan Newbold2020-02-261-0/+3
| * ES release: last minor tweaksBryan Newbold2020-02-261-2/+2
| * ES files: don't remove archive.org domains/hostsBryan Newbold2020-02-071-5/+0
| * ES releases: host/domain fixesBryan Newbold2020-01-311-2/+2
| * fix release es transform missing 'issue'Bryan Newbold2020-01-301-0/+1
| * add upper-case work-around from kibana map joinBryan Newbold2020-01-301-0/+1
| * tweak file ES archive.org domain trackingBryan Newbold2020-01-301-0/+6
| * implement host+domain parsing for file ES transformBryan Newbold2020-01-301-9/+5
| * fix ES file schema plural field namesBryan Newbold2020-01-291-4/+3
| * elastic schema fixesBryan Newbold2020-01-291-0/+5
| * add country to v03b release schemaBryan Newbold2020-01-291-0/+2
| * actually implement changelog transformBryan Newbold2020-01-291-17/+45
| * fix some transform bugs, add some testsBryan Newbold2020-01-291-6/+8
| * ES release schema updatesBryan Newbold2020-01-291-5/+76
| * container ES schema changesBryan Newbold2020-01-291-16/+18
| * first implementation of ES file schemaBryan Newbold2020-01-292-1/+46