summaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/workers/changelog.py
Commit message (Expand)AuthorAgeFilesLines
* typing: add assertions to fatcat_tool code to make type assumptions explicitBryan Newbold2021-11-031-0/+1
* typing: add annotations to remaining fatcat_tools codeBryan Newbold2021-11-031-17/+26
* fmt (black): fatcat_tools/Bryan Newbold2021-11-021-118/+138
* python: isort everythingBryan Newbold2021-11-021-1/+2
* changelog worker: fix file/fileset typo, caught by lintBryan Newbold2021-05-251-1/+1
* entity update worker: treat fileset and webcapture updates like file updatesBryan Newbold2020-12-161-3/+25
* entity updates: don't ingest JSTOR DOI prefixesBryan Newbold2020-10-231-0/+2
* entity updater: new work update feed (ident and changelog metadata only)Bryan Newbold2020-10-161-2/+24
* ingest: default to crawl protocols.io DOIsBryan Newbold2020-09-101-0/+2
* entity updater: handle doi=None case betterBryan Newbold2020-08-141-1/+1
* entity updater: es['publisher_type'] not always setBryan Newbold2020-08-141-1/+1
* entity update: change big5 ingest behaviorBryan Newbold2020-08-111-9/+15
* entity update: default to ingest non-OA worksBryan Newbold2020-08-111-9/+10
* entity update: skip ingest of figshare+zenodo 'group' DOIsBryan Newbold2020-08-111-0/+15
* update crawl blocklist for SPNv2 requests which mostly failBryan Newbold2020-08-101-2/+10
* lint (flake8) tool python filesBryan Newbold2020-07-011-1/+0
* changelog: limit typesMartin Czygan2020-04-161-5/+1
* changelog: extend release_types considered documentsMartin Czygan2020-04-161-10/+19
* ingest: more DOI patterns to treat as OABryan Newbold2020-03-281-0/+26
* ingest: always try some lancet journalsBryan Newbold2020-03-191-0/+3
* entity worker: ingest more releasesBryan Newbold2020-02-221-1/+37
* always crawl researchgate DOIsBryan Newbold2020-02-181-0/+2
* add acceptlist override for biorxiv/medrxivBryan Newbold2020-02-101-2/+12
* fix KafkaError worker reporting for partition errorsBryan Newbold2020-01-291-1/+1
* additional DOI prefix filtersBryan Newbold2020-01-281-0/+8
* apply ingest request filtering in entity workerBryan Newbold2020-01-281-3/+34
* update ingest request schemaBryan Newbold2019-12-131-1/+1
* project -> ingest_request_sourceBryan Newbold2019-11-151-1/+1
* add ingest request feature to entity_updates workerBryan Newbold2019-11-151-4/+20
* review/fix all confluent-kafka produce codeBryan Newbold2019-09-201-4/+12
* small fixes to confluent-kafka importers/workersBryan Newbold2019-09-201-4/+10
* convert pipeline workers from pykafka to confluent-kafkaBryan Newbold2019-09-201-67/+116
* fix typo in typoBryan Newbold2019-06-241-1/+1
* fix typo in changelog workerBryan Newbold2019-06-241-1/+1
* more links on new homepageBryan Newbold2019-06-191-1/+1
* fix and workaround container entities in release topicBryan Newbold2019-05-301-2/+2
* file and container update kafka topicsBryan Newbold2019-05-301-54/+69
* update elastic for releases when files addedBryan Newbold2019-05-301-1/+36
* 10 MByte default Kafka produce (workers)Bryan Newbold2019-03-061-2/+6
* bunch of lint/whitespace cleanupsBryan Newbold2019-02-221-2/+1
* include filesets and webcaptures in exportsBryan Newbold2019-01-181-1/+1
* workers do API-passing (not URI-passing)Bryan Newbold2019-01-081-4/+4
* not as strong a todo (timestamps)Bryan Newbold2018-11-191-1/+1
* bunch of pylint cleanupBryan Newbold2018-11-151-1/+1
* large refactor of python names/pathsBryan Newbold2018-11-151-3/+4
* fix worker codeBryan Newbold2018-11-141-2/+3
* most_recent_message as reusable functionBryan Newbold2018-11-141-26/+1
* switch to auto consumer offset updatesBryan Newbold2018-11-131-1/+6
* shuffle around fatcat_tools layoutBryan Newbold2018-11-131-0/+122