summaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/transforms
Commit message (Expand)AuthorAgeFilesLines
* ES schema updates: doc_index_ts as a str, not datetimeBryan Newbold2021-04-061-4/+4
* container search schema: preservation stats, new fieldsBryan Newbold2021-04-061-2/+18
* release ES: add discipline fieldBryan Newbold2021-04-061-0/+2
* ES schemas: add doc_index_ts to all mappingsBryan Newbold2021-04-061-0/+4
* elasticsearch: simple new dblp and doaj fieldsBryan Newbold2021-01-201-0/+4
* bug fix: is_preserved should always be boolBryan Newbold2020-12-171-2/+2
* fix indentationBryan Newbold2020-12-161-2/+2
* have release elasticsearch transform count webcaptures and filesets towards p...Bryan Newbold2020-12-161-26/+57
* small release_to_elasticsearch refactorsBryan Newbold2020-12-161-7/+12
* refactor release_to_elasticsearch transformBryan Newbold2020-12-161-131/+148
* if a release has DOAJ article id, count as OABryan Newbold2020-11-191-0/+3
* ingest tool: support for setting ingest typeBryan Newbold2020-11-061-6/+6
* elastic transform: more preservation keepersBryan Newbold2020-10-081-1/+2
* release ES transform tweaksBryan Newbold2020-08-071-3/+5
* basic toml transform helperBryan Newbold2020-07-302-4/+20
* simplify in_kbart check statementBryan Newbold2020-07-231-1/+1
* make in_kbart transform inclusive of last yearBryan Newbold2020-07-231-0/+9
* lint (flake8) tool python filesBryan Newbold2020-07-014-18/+10
* ES schema: add best_url to file schemaBryan Newbold2020-06-041-0/+12
* improve citeproc/CSL web interfaceBryan Newbold2020-03-251-6/+12
* Merge branch 'bnewbold-elastic-v03b'Bryan Newbold2020-02-262-46/+198
|\
| * improve is_oa flag accuracyBryan Newbold2020-02-261-8/+4
| * ES container last tweaksBryan Newbold2020-02-261-0/+3
| * ES release: last minor tweaksBryan Newbold2020-02-261-2/+2
| * ES files: don't remove archive.org domains/hostsBryan Newbold2020-02-071-5/+0
| * ES releases: host/domain fixesBryan Newbold2020-01-311-2/+2
| * fix release es transform missing 'issue'Bryan Newbold2020-01-301-0/+1
| * add upper-case work-around from kibana map joinBryan Newbold2020-01-301-0/+1
| * tweak file ES archive.org domain trackingBryan Newbold2020-01-301-0/+6
| * implement host+domain parsing for file ES transformBryan Newbold2020-01-301-9/+5
| * fix ES file schema plural field namesBryan Newbold2020-01-291-4/+3
| * elastic schema fixesBryan Newbold2020-01-291-0/+5
| * add country to v03b release schemaBryan Newbold2020-01-291-0/+2
| * actually implement changelog transformBryan Newbold2020-01-291-17/+45
| * fix some transform bugs, add some testsBryan Newbold2020-01-291-6/+8
| * ES release schema updatesBryan Newbold2020-01-291-5/+76
| * container ES schema changesBryan Newbold2020-01-291-16/+18
| * first implementation of ES file schemaBryan Newbold2020-01-292-1/+46
* | default to PMC ingest URLs over DOIBryan Newbold2020-02-041-4/+4
|/
* remove 'oa_only' feature from ingest transformBryan Newbold2020-01-281-14/+1
* transform ingests via pmc/pmcid, not pubmed/pmidBryan Newbold2019-12-241-4/+4
* update ingest request schemaBryan Newbold2019-12-131-5/+22
* tweaks to ingest-file transformBryan Newbold2019-12-121-13/+7
* project -> ingest_request_sourceBryan Newbold2019-11-151-2/+2
* fix release.pmcid typoBryan Newbold2019-11-151-2/+2
* more ingest importer comments and countsBryan Newbold2019-11-151-1/+1
* add ingest request transform (and test)Bryan Newbold2019-11-152-0/+67
* dict wrapper for entity_from_json()Bryan Newbold2019-10-082-3/+7
* refactor all python source for client lib nameBryan Newbold2019-09-053-3/+3
* comment clarifying container.ident in ES release transformBryan Newbold2019-09-031-0/+2