aboutsummaryrefslogtreecommitdiffstats
path: root/python
Commit message (Collapse)AuthorAgeFilesLines
* small changes to preservation coverage search queriesBryan Newbold2022-02-032-10/+67
| | | | | | | - allow fetching of by-release-type preservation histograms as JSON - query flag to exclude 'stub' entity types - don't include 'stub' entities in container-by-year or container-by-volume charts (and JSON)
* pipenv: update depsBryan Newbold2022-02-031-475/+458
|
* pipenv: black (code style tool) has a stable releaseBryan Newbold2022-02-031-3/+1
|
* container ES transform: include old extra.issne/p fieldsBryan Newbold2022-02-031-1/+4
| | | | | These were removed prematurely. Not all containers have been updated to use these fields yet.
* web: update front-page statsBryan Newbold2022-02-011-3/+3
|
* Merge branch 'bnewbold-read-only' into 'master'bnewbold2022-01-252-1/+16
|\ | | | | | | | | read-only database mode See merge request webgroup/fatcat!137
| * web: support for alert messages (site-wide)Bryan Newbold2022-01-212-1/+16
| |
* | ingest tool: clear_scroll allowed in es-public-proxy for some timeBryan Newbold2022-01-211-8/+0
|/
* Merge branch 'bnewbold-file-es' into 'master'bnewbold2022-01-214-4/+66
|\ | | | | | | | | File entity elasticsearch index worker See merge request webgroup/fatcat!136
| * entity worker: expand creators in release entitiesBryan Newbold2021-12-151-1/+1
| |
| * small default config typo fixes for elasticsearch workersBryan Newbold2021-12-151-2/+2
| |
| * file elasticsearch index workerBryan Newbold2021-12-153-1/+63
| |
* | elasticsearch: bump timeout to 40 seconds (from default of 10)Bryan Newbold2022-01-101-1/+1
| |
* | make fmtBryan Newbold2021-12-152-5/+6
| |
* | Merge branch 'martin-sentry-sdk' into 'master'bnewbold2021-12-1610-344/+396
|\ \ | | | | | | | | | | | | move from raven to sentry_sdk See merge request webgroup/fatcat!135
| * | move from raven to sentry_sdkMartin Czygan2021-12-1410-344/+396
| |/ | | | | | | | | | | | | | | | | related docs: * https://docs.sentry.io/platforms/python/guides/flask/migration/ * https://docs.sentry.io/platforms/python/guides/asgi/configuration/integrations/flask/ > `fetch_git_sha` is gone, see: https://forum.sentry.io/t/fetch-git-sha-equivalent-in-the-unified-python-sdk/5521
* / crossref importer: skip affiliations lacking 'name'Bryan Newbold2021-12-151-0/+3
|/ | | | Relatedly, we should start handling ROR affiliations in contribs soon.
* mergers: fix typo in env var nameBryan Newbold2021-12-073-3/+3
|
* ES container schema: add 'sim_pubid' and `ia_sim_collection` fieldsBryan Newbold2021-12-031-0/+2
|
* ES transform: remove prototype microfilm linksBryan Newbold2021-12-031-20/+0
| | | | This ended up being a feature in scholar.archive.org, not fatcat.
* chocula importer: handle not-upper-case ISSNsBryan Newbold2021-11-301-2/+6
|
* chocula importer: handle broken ISSNs in extra metadataBryan Newbold2021-11-301-2/+7
|
* chocula importer: tweak counting, conditions for doing updatesBryan Newbold2021-11-301-15/+7
|
* chocula importer: move issne/issnp 'extra' to top-level fields if doing updatesBryan Newbold2021-11-301-0/+6
|
* chocula: don't do name cleanups in importerBryan Newbold2021-11-301-8/+2
| | | | This kind of cleanup should be done in 'chocula' instead.
* container merger: fix bug with filtering by release countBryan Newbold2021-11-301-13/+15
| | | | | Also apply the "human edit" and "release count" checks only to the dupe (to-be-redirected) idents.
* release merger: same editgroup_id fixes as for file and container mergersBryan Newbold2021-11-241-1/+5
|
* container merger: fixes from QA testingBryan Newbold2021-11-241-8/+13
|
* mergers: don't try to accept empty editgroups in dry-run-modeBryan Newbold2021-11-241-2/+4
|
* ES release transform: handle redirected containers betterBryan Newbold2021-11-241-1/+1
| | | | | Despite the inline comment, we were not actually grabbing the "redirected" ident correctly, meaning some counts would not be accurate.
* container merger: defer allocation of editgroup_id; and dummy code pathBryan Newbold2021-11-241-1/+5
|
* initial implementation of container mergerBryan Newbold2021-11-242-0/+353
|
* file merger: allocate editgroup id later in 'merge' processBryan Newbold2021-11-241-1/+5
| | | | | The motivation is to avoid creating empty editgroups in dry-run mode, and when all entities are "skipped"
* Merge branch 'bnewbold-mergers' into 'master'bnewbold2021-11-255-0/+800
|\ | | | | | | | | entity mergers framework See merge request webgroup/fatcat!133
| * mergers common: remove inaccurate commentBryan Newbold2021-11-241-2/+0
| | | | | | | | Caught in review, thanks miku
| * file merger: add content_scope to list of merged fieldsBryan Newbold2021-11-241-1/+1
| |
| * release merger: some progress, but also disable (not complete)Bryan Newbold2021-11-231-12/+72
| |
| * file merges: fixes from testing in QABryan Newbold2021-11-231-14/+23
| |
| * mergers: small tweaksBryan Newbold2021-11-232-3/+3
| |
| * mergers: remove entity mergers from __init__ (to work around warning)Bryan Newbold2021-11-231-2/+0
| |
| * initial file merger, with testsBryan Newbold2021-11-232-0/+388
| |
| * mergers: fmt, lint, refactorsBryan Newbold2021-11-233-96/+200
| | | | | | | | | | These old merger code is from an old branch and needed significant cleanup
| * remove top-level fatcat_merge.py; going to call module __main__ going forwardBryan Newbold2021-11-231-112/+0
| |
| * first iteration of mergersBryan Newbold2021-11-234-0/+355
| |
* | codespell fixes to various other docsBryan Newbold2021-11-241-1/+1
| |
* | codespell fixes in python code (comments)Bryan Newbold2021-11-244-6/+6
| |
* | codespell fixes in web interface templatesBryan Newbold2021-11-2414-19/+19
|/
* Merge branch 'bnewbold-content-scope'Bryan Newbold2021-11-225-1/+8
|\
| * bump python client to 0.5.0Bryan Newbold2021-11-171-1/+1
| |
| * content_scope: include in file ES schema and transformBryan Newbold2021-11-171-0/+1
| |