Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
| * | Add basic pagination to search results | Martin Czygan | 2019-11-08 | 4 | -14/+67 | |
| | | | | | | | | | | | | | | | | | | | | | | | | The "deep paging problem" imposes some limit, which currently is a hardcoded default value, `deep_page_limit=2000` in `do_search`. Elasticsearch can be configured, too: > Note that from + size can not be more than the index.max_result_window index setting, which defaults to 10,000. -- https://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-body.html#request-body-search-from-size | |||||
* | | web: catch MacaroonInitException | Bryan Newbold | 2019-11-12 | 1 | -0/+4 | |
| | | | | | | | | | | Caught one of these in sentry. Probably due to a crawler? Or typing gibberish in the token form. | |||||
* | | design notes for a larger database | Bryan Newbold | 2019-11-12 | 1 | -0/+81 | |
| | | ||||||
* | | old proposals for 'next' schema update | Bryan Newbold | 2019-11-12 | 1 | -0/+38 | |
| | | ||||||
* | | crossref patch bulk import | Bryan Newbold | 2019-11-12 | 2 | -0/+63 | |
| | | ||||||
* | | Merge branch 'martin-python-readme-es-note' into 'master' | bnewbold | 2019-11-08 | 1 | -0/+5 | |
|\ \ | | | | | | | | | | | | | mention elasticsearch empty index setup See merge request webgroup/fatcat!3 | |||||
| * | | mention elasticsearch empty index setup | Martin Czygan | 2019-11-08 | 1 | -0/+5 | |
| |/ | | | | | | | | | | | When setting up with the defaults, all works fine, except that the web search will try to access a local elasticsearch. Mention in README, how to create empty indices. | |||||
* | | crossref: accurate blank title counts | Bryan Newbold | 2019-11-05 | 1 | -0/+1 | |
| | | ||||||
* | | fix crossref component test | Bryan Newbold | 2019-11-04 | 1 | -1/+1 | |
| | | ||||||
* | | TODO idea: 'first seen' | Bryan Newbold | 2019-11-04 | 1 | -0/+1 | |
| | | ||||||
* | | crossref: component type | Bryan Newbold | 2019-11-04 | 1 | -1/+3 | |
| | | ||||||
* | | add 'component' as a release_type | Bryan Newbold | 2019-11-04 | 2 | -0/+3 | |
| | | ||||||
* | | crossref: count why skip happened | Bryan Newbold | 2019-11-04 | 1 | -1/+7 | |
| | | | | | | | | | | | | Might skip based on release type (eg container, not a paper/release), or missing title, or other reasons. Over 7 million DOIs are getting skipped, curious why. | |||||
* | | crossref: don't skip on short/null subtitle | Bryan Newbold | 2019-11-04 | 1 | -1/+1 | |
|/ | | | | This was a bug. Should only set subtitle black, not skip the import. | |||||
* | note file fixup pushed in prod | Bryan Newbold | 2019-10-09 | 2 | -1/+64 | |
| | ||||||
* | move corpus changes to 'notes/bulk_edits' | Bryan Newbold | 2019-10-08 | 3 | -0/+285 | |
| | ||||||
* | commit file cleaner tests | Bryan Newbold | 2019-10-08 | 1 | -0/+58 | |
| | ||||||
* | file cleanup tweaks to actually run | Bryan Newbold | 2019-10-08 | 2 | -5/+4 | |
| | ||||||
* | refactor duplicated b32_hex function in importers | Bryan Newbold | 2019-10-08 | 3 | -21/+11 | |
| | ||||||
* | dict wrapper for entity_from_json() | Bryan Newbold | 2019-10-08 | 2 | -3/+7 | |
| | ||||||
* | new cleanup python tool/framework | Bryan Newbold | 2019-10-08 | 5 | -0/+300 | |
| | ||||||
* | CHANGELOG entry for previous commit | Bryan Newbold | 2019-10-03 | 1 | -0/+6 | |
| | ||||||
* | redirect direct entity underscore links | Bryan Newbold | 2019-10-03 | 2 | -0/+30 | |
| | ||||||
* | export raw affiliation strings for analysis | Bryan Newbold | 2019-10-03 | 1 | -0/+17 | |
| | ||||||
* | update rust README with fatcat_test db creation note | Bryan Newbold | 2019-10-03 | 1 | -1/+4 | |
| | ||||||
* | update rust README re: openssl | Bryan Newbold | 2019-10-01 | 1 | -17/+1 | |
| | | | | | | | | | I believe an openssl library is still required locally, but with the SSL/TLS removal it now doesn't matter if it is OpenSSL 1.0 or 1.1. This is only a temporary work-around. When we update rust code generation, we will need to revisit these changes. The current version of swagger-rs still depends on HTTPS and OpenSSL 1.0 (via dependencies). | |||||
* | entirely remove unused https flag to fatcatd | Bryan Newbold | 2019-09-29 | 1 | -15/+6 | |
| | ||||||
* | cargo update fatcat rust after openssl removal | Bryan Newbold | 2019-09-29 | 1 | -76/+32 | |
| | ||||||
* | remove 'client' and hyper-openssl options from fatcat-openapi rust crate | Bryan Newbold | 2019-09-29 | 1 | -3/+6 | |
| | ||||||
* | webface: extra <br> in container lookup links | Bryan Newbold | 2019-09-21 | 1 | -1/+1 | |
| | ||||||
* | remove duplicate style ref in container edit view | Bryan Newbold | 2019-09-20 | 1 | -5/+0 | |
| | ||||||
* | review/fix all confluent-kafka produce code | Bryan Newbold | 2019-09-20 | 6 | -27/+75 | |
| | ||||||
* | small fixes to confluent-kafka importers/workers | Bryan Newbold | 2019-09-20 | 8 | -26/+69 | |
| | | | | | | | | - decrease default changelog pipeline to 5.0sec - fix missing KafkaException harvester imports - more confluent-kafka tweaks - updates to kafka consumer configs - bump elastic updates consumergroup (again) | |||||
* | update Pipfile.lock after confluent-kafka rebase | Bryan Newbold | 2019-09-20 | 1 | -1/+33 | |
| | ||||||
* | convert pipeline workers from pykafka to confluent-kafka | Bryan Newbold | 2019-09-20 | 3 | -125/+230 | |
| | ||||||
* | small kafka tweaks for robustness | Bryan Newbold | 2019-09-20 | 2 | -0/+5 | |
| | ||||||
* | convert importers to confluent-kafka library | Bryan Newbold | 2019-09-20 | 2 | -21/+74 | |
| | ||||||
* | bump max message size to ~20 MBytes | Bryan Newbold | 2019-09-20 | 2 | -0/+2 | |
| | ||||||
* | fixes to confluent-kafka harvesters | Bryan Newbold | 2019-09-20 | 3 | -20/+21 | |
| | ||||||
* | docker-compose: kafka 2.0, and -dev topic names | Bryan Newbold | 2019-09-20 | 1 | -3/+2 | |
| | ||||||
* | first draft harvesters using confluent-kafka | Bryan Newbold | 2019-09-20 | 3 | -48/+104 | |
| | ||||||
* | make default kafka env 'dev', not 'qa' | Bryan Newbold | 2019-09-20 | 2 | -4/+4 | |
| | ||||||
* | add confluent-kafka library (to replace pykafka) | Bryan Newbold | 2019-09-20 | 1 | -0/+1 | |
| | ||||||
* | guide: remove Recurse mention from CoC | Bryan Newbold | 2019-09-20 | 1 | -3/+2 | |
| | ||||||
* | CHANGELOG note of python codegen change | Bryan Newbold | 2019-09-19 | 1 | -0/+4 | |
| | ||||||
* | fix another python codegen auth contamination bug | Bryan Newbold | 2019-09-18 | 2 | -5/+37 | |
| | | | | | Seems to be the classic one where a dict as a default arg gets mutated then reused across instances. Blech. | |||||
* | python codegen: don't clobber README every time | Bryan Newbold | 2019-09-18 | 2 | -1/+2 | |
| | ||||||
* | python codegen with new openapi-generator tool | Bryan Newbold | 2019-09-18 | 79 | -5689/+6824 | |
| | ||||||
* | update python codegen script with new generator | Bryan Newbold | 2019-09-18 | 1 | -58/+6 | |
| | ||||||
* | spec: invalid to have an exmple here | Bryan Newbold | 2019-09-18 | 1 | -1/+0 | |
| |