Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | more interesting example entities (eg, to crawl) | Bryan Newbold | 2021-05-18 | 1 | -0/+19 |
| | |||||
* | elasticsearch ref schema: 6 shards, not 12 | Bryan Newbold | 2021-05-18 | 1 | -1/+1 |
| | |||||
* | Merge branch 'bnewbold-pipenv-cleanup' into 'master' | bnewbold | 2021-04-23 | 2 | -327/+277 |
|\ | | | | | | | | | pipenv cleanup See merge request webgroup/fatcat!104 | ||||
| * | pipenv: re-lock project | Bryan Newbold | 2021-04-19 | 1 | -301/+253 |
| | | |||||
| * | pipenv: constrain most package versions to at least major | Bryan Newbold | 2021-04-19 | 1 | -24/+24 |
| | | | | | | | | | | | | | | | | | | Don't have a complete policy with this change, just locking things down a bit more so small package additions and updates don't end up upgrading some small dependency to a major new backwards-incompatible version. Also, correct bs4 -> beautifulsoup4 (bs4 is the import name, not the package name) | ||||
| * | pipenv: remove unused pg-view and pykafka libraries | Bryan Newbold | 2021-04-19 | 1 | -2/+0 |
| | | |||||
* | | web: fix edit form style guide links | Bryan Newbold | 2021-04-20 | 2 | -4/+4 |
|/ | |||||
* | transforms: fix 'display_ame' typo | Bryan Newbold | 2021-04-19 | 1 | -2/+2 |
| | |||||
* | web: expand release creators in more situations | Bryan Newbold | 2021-04-19 | 2 | -2/+2 |
| | |||||
* | fix public API links | Bryan Newbold | 2021-04-15 | 1 | -2/+2 |
| | |||||
* | Merge branch 'bnewbold-ui-tweaks-202104' into 'master' | bnewbold | 2021-04-13 | 18 | -41/+90 |
|\ | | | | | | | | | Misc UI tweaks (2021-04) See merge request webgroup/fatcat!103 | ||||
| * | fix 'colected' typos | Bryan Newbold | 2021-04-13 | 2 | -2/+2 |
| | | | | | | | | Thanks for the catch martin | ||||
| * | prefer contrib.creator.display_name over contrib.raw_name | Bryan Newbold | 2021-04-12 | 4 | -9/+17 |
| | | | | | | | | | | | | | | | | These will be getting updates from ORCID and are usually more complete and more correct for display, attribution, and search purposes. Might need to tweak fuzzycat code to handle multiple names at the verification stage. | ||||
| * | make dblp tests more robust | Bryan Newbold | 2021-04-12 | 1 | -2/+11 |
| | | | | | | | | | | | | These were causing a lot of spurious errors in local development. Not sure these tweaks will entirely fix the problem. | ||||
| * | web: show file size not known, when it isn't | Bryan Newbold | 2021-04-12 | 1 | -0/+2 |
| | | | | | | | | This is mostly to prevent showing an empty metadata box | ||||
| * | web: better logic for showing 'save-paper-now' link | Bryan Newbold | 2021-04-12 | 1 | -0/+2 |
| | | |||||
| * | web: include DOI in share-your-paper URL, when possible | Bryan Newbold | 2021-04-12 | 1 | -2/+8 |
| | | |||||
| * | web: consistent public API URLs | Bryan Newbold | 2021-04-12 | 6 | -14/+9 |
| | | |||||
| * | web: improve preservation holdings display for containers | Bryan Newbold | 2021-04-12 | 1 | -10/+22 |
| | | |||||
| * | web: improve access button HTML | Bryan Newbold | 2021-04-12 | 2 | -3/+2 |
| | | |||||
| * | web: add goatcounter analytics | Bryan Newbold | 2021-04-12 | 3 | -0/+16 |
|/ | | | | Same setup as scholar.archive.org | ||||
* | es worker: ensure kafka messages get cleared | Bryan Newbold | 2021-04-12 | 1 | -0/+2 |
| | |||||
* | es indexing: more 'wip' fixes | Bryan Newbold | 2021-04-12 | 1 | -1/+5 |
| | |||||
* | guide and openapi schema: fix QA URLs, and disclaim QA instance | Bryan Newbold | 2021-04-12 | 4 | -10/+12 |
| | |||||
* | ES indexing: skip 'wip' entities with a warning | Bryan Newbold | 2021-04-12 | 1 | -11/+16 |
| | |||||
* | guide: push to both prod sites | Bryan Newbold | 2021-04-12 | 1 | -0/+1 |
| | |||||
* | update elasticsearch bootstrap indexing notes | Bryan Newbold | 2021-04-09 | 1 | -8/+16 |
| | |||||
* | fatcat_ingest: fix recent lint failure | Bryan Newbold | 2021-04-09 | 1 | -1/+1 |
| | |||||
* | search: more ES 7.x changes (track total counts) | Bryan Newbold | 2021-04-09 | 2 | -0/+12 |
| | |||||
* | CHANGELOG updates (partial; unreleased) | Bryan Newbold | 2021-04-08 | 1 | -0/+21 |
| | |||||
* | ES: rename fatcat_ref.json to ref_schema.json for consistency; add to README | Bryan Newbold | 2021-04-08 | 2 | -1/+4 |
| | |||||
* | release ES schema: fix typo with shard/replica configuration | Bryan Newbold | 2021-04-08 | 1 | -1/+1 |
| | |||||
* | sitemaps: filter to releases with PDF fulltext (for now) | Bryan Newbold | 2021-04-07 | 1 | -0/+2 |
| | |||||
* | Merge branch 'bnewbold-es-index-updates' into 'master' | bnewbold | 2021-04-08 | 14 | -27/+173 |
|\ | | | | | | | | | fatcat elasticsearch schema updates See merge request webgroup/fatcat!101 | ||||
| * | container ES index worker: support for querying status | Bryan Newbold | 2021-04-06 | 2 | -5/+37 |
| | | |||||
| * | transform tool: container transform stats lookup support | Bryan Newbold | 2021-04-06 | 2 | -2/+27 |
| | | |||||
| * | ES schema updates: doc_index_ts as a str, not datetime | Bryan Newbold | 2021-04-06 | 1 | -4/+4 |
| | | | | | | | | | | The schema is a timestamp, but python needs to serialize as JSON, and doesn't do datetime automatically. | ||||
| * | web infra: log to stderr | Bryan Newbold | 2021-04-06 | 1 | -2/+4 |
| | | |||||
| * | search container stats: changes to be called from index code path | Bryan Newbold | 2021-04-06 | 2 | -3/+20 |
| | | | | | | | | Eg, allowing injection of more config values | ||||
| * | container search schema: preservation stats, new fields | Bryan Newbold | 2021-04-06 | 3 | -15/+69 |
| | | | | | | | | Includes transform code updates and partial test coverage. | ||||
| * | release ES: add discipline field | Bryan Newbold | 2021-04-06 | 2 | -0/+3 |
| | | |||||
| * | ES schemas: add doc_index_ts to all mappings | Bryan Newbold | 2021-04-06 | 6 | -0/+13 |
| | | |||||
* | | Merge branch 'bnewbold-es7' into 'master' | bnewbold | 2021-04-07 | 11 | -299/+294 |
|\| | | | | | | | | | elasticsearch 7.x support See merge request webgroup/fatcat!100 | ||||
| * | web search: ES 6+7 compatibliity | Bryan Newbold | 2021-04-06 | 1 | -9/+21 |
| | | | | | | | | Based on the similar changes made in fatcat-scholar | ||||
| * | indexing: don't use document names | Bryan Newbold | 2021-04-06 | 1 | -14/+4 |
| | | |||||
| * | pipenv: switch to ES 7.x client libraries | Bryan Newbold | 2021-04-06 | 2 | -151/+245 |
| | | |||||
| * | elasticsearch schema, docs, docker: update from ES 6.x to ES 7.x | Bryan Newbold | 2021-04-06 | 7 | -125/+24 |
|/ | | | | | Including removing index document names (use '_doc' instead during transition) | ||||
* | Merge branch 'martin-es-schema-citations' into 'master' | bnewbold | 2021-04-02 | 1 | -0/+106 |
|\ | | | | | | | | | add es draft schema for references See merge request webgroup/fatcat!99 | ||||
| * | add es draft schema for references | Martin Czygan | 2021-03-30 | 1 | -0/+106 |
| | | |||||
* | | Merge branch 'martin-datacite-release-contrib-err-sentry-77700' into 'master' | bnewbold | 2021-04-02 | 3 | -4/+1 |
|\ \ | |/ |/| | | | | | datacite: a missing surname should be None, not the empty string See merge request webgroup/fatcat!102 |