aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* more interesting example entities (eg, to crawl)Bryan Newbold2021-05-181-0/+19
|
* elasticsearch ref schema: 6 shards, not 12Bryan Newbold2021-05-181-1/+1
|
* Merge branch 'bnewbold-pipenv-cleanup' into 'master'bnewbold2021-04-232-327/+277
|\ | | | | | | | | pipenv cleanup See merge request webgroup/fatcat!104
| * pipenv: re-lock projectBryan Newbold2021-04-191-301/+253
| |
| * pipenv: constrain most package versions to at least majorBryan Newbold2021-04-191-24/+24
| | | | | | | | | | | | | | | | | | Don't have a complete policy with this change, just locking things down a bit more so small package additions and updates don't end up upgrading some small dependency to a major new backwards-incompatible version. Also, correct bs4 -> beautifulsoup4 (bs4 is the import name, not the package name)
| * pipenv: remove unused pg-view and pykafka librariesBryan Newbold2021-04-191-2/+0
| |
* | web: fix edit form style guide linksBryan Newbold2021-04-202-4/+4
|/
* transforms: fix 'display_ame' typoBryan Newbold2021-04-191-2/+2
|
* web: expand release creators in more situationsBryan Newbold2021-04-192-2/+2
|
* fix public API linksBryan Newbold2021-04-151-2/+2
|
* Merge branch 'bnewbold-ui-tweaks-202104' into 'master'bnewbold2021-04-1318-41/+90
|\ | | | | | | | | Misc UI tweaks (2021-04) See merge request webgroup/fatcat!103
| * fix 'colected' typosBryan Newbold2021-04-132-2/+2
| | | | | | | | Thanks for the catch martin
| * prefer contrib.creator.display_name over contrib.raw_nameBryan Newbold2021-04-124-9/+17
| | | | | | | | | | | | | | | | These will be getting updates from ORCID and are usually more complete and more correct for display, attribution, and search purposes. Might need to tweak fuzzycat code to handle multiple names at the verification stage.
| * make dblp tests more robustBryan Newbold2021-04-121-2/+11
| | | | | | | | | | | | These were causing a lot of spurious errors in local development. Not sure these tweaks will entirely fix the problem.
| * web: show file size not known, when it isn'tBryan Newbold2021-04-121-0/+2
| | | | | | | | This is mostly to prevent showing an empty metadata box
| * web: better logic for showing 'save-paper-now' linkBryan Newbold2021-04-121-0/+2
| |
| * web: include DOI in share-your-paper URL, when possibleBryan Newbold2021-04-121-2/+8
| |
| * web: consistent public API URLsBryan Newbold2021-04-126-14/+9
| |
| * web: improve preservation holdings display for containersBryan Newbold2021-04-121-10/+22
| |
| * web: improve access button HTMLBryan Newbold2021-04-122-3/+2
| |
| * web: add goatcounter analyticsBryan Newbold2021-04-123-0/+16
|/ | | | Same setup as scholar.archive.org
* es worker: ensure kafka messages get clearedBryan Newbold2021-04-121-0/+2
|
* es indexing: more 'wip' fixesBryan Newbold2021-04-121-1/+5
|
* guide and openapi schema: fix QA URLs, and disclaim QA instanceBryan Newbold2021-04-124-10/+12
|
* ES indexing: skip 'wip' entities with a warningBryan Newbold2021-04-121-11/+16
|
* guide: push to both prod sitesBryan Newbold2021-04-121-0/+1
|
* update elasticsearch bootstrap indexing notesBryan Newbold2021-04-091-8/+16
|
* fatcat_ingest: fix recent lint failureBryan Newbold2021-04-091-1/+1
|
* search: more ES 7.x changes (track total counts)Bryan Newbold2021-04-092-0/+12
|
* CHANGELOG updates (partial; unreleased)Bryan Newbold2021-04-081-0/+21
|
* ES: rename fatcat_ref.json to ref_schema.json for consistency; add to READMEBryan Newbold2021-04-082-1/+4
|
* release ES schema: fix typo with shard/replica configurationBryan Newbold2021-04-081-1/+1
|
* sitemaps: filter to releases with PDF fulltext (for now)Bryan Newbold2021-04-071-0/+2
|
* Merge branch 'bnewbold-es-index-updates' into 'master'bnewbold2021-04-0814-27/+173
|\ | | | | | | | | fatcat elasticsearch schema updates See merge request webgroup/fatcat!101
| * container ES index worker: support for querying statusBryan Newbold2021-04-062-5/+37
| |
| * transform tool: container transform stats lookup supportBryan Newbold2021-04-062-2/+27
| |
| * ES schema updates: doc_index_ts as a str, not datetimeBryan Newbold2021-04-061-4/+4
| | | | | | | | | | The schema is a timestamp, but python needs to serialize as JSON, and doesn't do datetime automatically.
| * web infra: log to stderrBryan Newbold2021-04-061-2/+4
| |
| * search container stats: changes to be called from index code pathBryan Newbold2021-04-062-3/+20
| | | | | | | | Eg, allowing injection of more config values
| * container search schema: preservation stats, new fieldsBryan Newbold2021-04-063-15/+69
| | | | | | | | Includes transform code updates and partial test coverage.
| * release ES: add discipline fieldBryan Newbold2021-04-062-0/+3
| |
| * ES schemas: add doc_index_ts to all mappingsBryan Newbold2021-04-066-0/+13
| |
* | Merge branch 'bnewbold-es7' into 'master'bnewbold2021-04-0711-299/+294
|\| | | | | | | | | elasticsearch 7.x support See merge request webgroup/fatcat!100
| * web search: ES 6+7 compatibliityBryan Newbold2021-04-061-9/+21
| | | | | | | | Based on the similar changes made in fatcat-scholar
| * indexing: don't use document namesBryan Newbold2021-04-061-14/+4
| |
| * pipenv: switch to ES 7.x client librariesBryan Newbold2021-04-062-151/+245
| |
| * elasticsearch schema, docs, docker: update from ES 6.x to ES 7.xBryan Newbold2021-04-067-125/+24
|/ | | | | Including removing index document names (use '_doc' instead during transition)
* Merge branch 'martin-es-schema-citations' into 'master'bnewbold2021-04-021-0/+106
|\ | | | | | | | | add es draft schema for references See merge request webgroup/fatcat!99
| * add es draft schema for referencesMartin Czygan2021-03-301-0/+106
| |
* | Merge branch 'martin-datacite-release-contrib-err-sentry-77700' into 'master'bnewbold2021-04-023-4/+1
|\ \ | |/ |/| | | | | datacite: a missing surname should be None, not the empty string See merge request webgroup/fatcat!102