summaryrefslogtreecommitdiffstats
path: root/python/fatcat_web
Commit message (Collapse)AuthorAgeFilesLines
* finish backend refactoring of search codeBryan Newbold2020-07-241-133/+108
|
* refactor release and container searchBryan Newbold2020-07-245-131/+233
| | | | | | | | | | Based on fatcat-scholar refactoring. This doesn't include refactoring of stats, aggregates, or histograms yet, just the direct queries. Don't have any test coverage yet; intend to try elasticmock or figuring out how to ingest mock JSON results directly.
* web search: fix pylint errorBryan Newbold2020-07-241-2/+2
|
* WIP: refactoring search to use elasticsearch-dslBryan Newbold2020-07-242-153/+137
|
* simple lint (flake8) fixes over python codebaseBryan Newbold2020-07-232-2/+2
| | | | | | These should not have any behavior changes, though a number of exception catches are now more general, and there may be long-tail exceptions getting thrown in these statements.
* get mediawiki username creation working with spacesBryan Newbold2020-07-091-1/+2
|
* lint (flake8) web interface python filesBryan Newbold2020-07-017-26/+16
|
* make fulltext-only label clickableMartin Czygan2020-06-161-2/+2
|
* use ES 'best_url' in file download pagesBryan Newbold2020-06-042-2/+4
| | | | Similar to recent change for release download pages.
* re-use 'best pdf url' for release green buttonBryan Newbold2020-06-041-2/+2
| | | | | | | I thought this was the existing behavior, but it looks like we were just taking the first link from the first file. In the future may refactor this out even further.
* Merge branch 'martin-fix-container-empty-search' into 'master'Martin Czygan2020-04-291-0/+4
|\ | | | | | | | | search: assume * when q is not set or empty See merge request webgroup/fatcat!51
| * search: assume * when q is not set or emptyMartin Czygan2020-04-291-0/+4
| | | | | | | | An example would be a blank search from a container details page.
* | Merge branch 'bnewbold-search-tweaks' into 'master'bnewbold2020-04-273-92/+132
|\ \ | |/ |/| | | | | tweaks to search result pages See merge request webgroup/fatcat!50
| * web search: tweak release search result styleBryan Newbold2020-04-231-25/+51
| | | | | | | | | | | | | | | | | | | | | | | | | | This is also back-ported from covid19.fatcat.wiki, though with some more tweaks on top. The changes are: - show original title if available (usually non-English) - move release_type label to title line suffix, and only show if not a "paper" - show publication status and withdrawl as text after the journal title, not as a label
| * web search: improve indentation, fix missing div tagsBryan Newbold2020-04-232-67/+81
| | | | | | | | These are back-ported fixes from covid19.fatcat.wiki
* | web: fix ident=None broken linksBryan Newbold2020-04-233-6/+10
|/ | | | | | On web interface views for revisions, we had a bunch of broken links because the ident is "None". This commit fixes these by removing the links.
* consistently use raw string prefix for regexBryan Newbold2020-04-171-1/+1
|
* web: small corrections to subtitle/original_nameBryan Newbold2020-03-282-2/+4
|
* Merge branch 'bnewbold-translation-view'Bryan Newbold2020-03-282-0/+7
|\
| * web: use more idiomatic type checkBryan Newbold2020-03-281-1/+1
| | | | | | | | Thanks to Martin for suggestion
| * web: display container original titlesBryan Newbold2020-03-261-0/+3
| |
| * web: copy extra.subtitle to subtitle for release viewBryan Newbold2020-03-261-0/+4
| | | | | | | | | | The release view will display subtitles, but it needs to be in the correct "location".
* | Merge pull request #53 from EdwardBetts/spellingbnewbold2020-03-273-3/+3
|\ \ | | | | | | Correct spelling mistakes
| * | Correct spelling mistakesEdward Betts2020-03-273-3/+3
| | |
* | | Merge branch 'bnewbold-400-bad-revisions' into 'master'Martin Czygan2020-03-261-2/+12
|\ \ \ | | | | | | | | | | | | | | | | catch ApiValueError in some generic API calls See merge request webgroup/fatcat!35
| * | | catch ApiValueError in some generic API callsBryan Newbold2020-03-251-2/+12
| | |/ | |/| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | The motivation for this change is to handle bogus revision IDs in URLs, which were causing 500 errors not 400 errors. Eg: https://qa.fatcat.wiki/file/rev/5d5d5162-b676-4f0a-968f-e19dadeaf96e%2B2019-11-27%2B13:49:51%2B0%2B6 I have no idea where these URLs are actually coming from, but they should be 4xx not 5xx. Investigating made me realize there is a whole category of ApiValueError exceptions we were not catching and should have been.
* / | improve citeproc/CSL web interfaceBryan Newbold2020-03-252-3/+7
|/ / | | | | | | | | | | | | | | | | | | | | | | | | | | This tries to show the citeproc (bibtext, MLA, CSL-JSON) options for more releases, and not show the links when they would break. The primary motivation here is to work around two exceptions being thrown in prod every day (according to sentry): KeyError: 'role' ValueError: CLS requries some surname (family name) I'm guessing these are mostly coming from crawlers following the citeproc links on release landing pages.
* | container lookup: link to issn portal searchMartin Czygan2020-03-181-4/+3
| | | | | | | | | | | | | | | | | | Example: https://fatcat.wiki/container/lookup?issnl=2007-1248 - the linked https://portal.issn.org/2007-1248 yields a "page not found", while search yields results: https://portal.issn.org/api/search?search[]=MUST=allissnbis=2007-1248
* | update front-page statsBryan Newbold2020-03-171-3/+3
|/
* Merge branch 'bnewbold-elastic-v03b'Bryan Newbold2020-02-261-2/+2
|\
| * new biblio-only general searchBryan Newbold2020-01-291-2/+2
| | | | | | | | The other fields are now "copy_to" the merged biblio field.
* | clarify shadow filter codeBryan Newbold2020-02-131-1/+1
| |
* | filter out shadows-only files in webfaceBryan Newbold2020-02-132-2/+7
|/
* increase kafka-pixy timeout to 25 secondsBryan Newbold2020-01-281-1/+1
|
* pagination: use correct endpoint in navigationMartin Czygan2020-01-182-6/+6
|
* datacite release links and metadata expansionBryan Newbold2019-12-202-9/+13
| | | | | | | Small ergonomic changes for datacite releases: - add a link to live/current datacite metadata (like we do for Crossref) - expand "extra" metadata fields under 'datacite' dict in metadata view
* spn: incluce link_source/link_source_id in ingest requestBryan Newbold2019-12-201-0/+2
|
* fix spn kafka topic env varBryan Newbold2019-12-131-1/+1
|
* update ingest request schemaBryan Newbold2019-12-132-8/+14
| | | | | This is mostly changing ingest_type from 'file' to 'pdf', and adding 'link_source'/'link_source_id', plus some small cleanups.
* initial 'Save Paper Now' web formBryan Newbold2019-12-126-2/+224
|
* fix delete release history viewBryan Newbold2019-12-091-1/+1
| | | | | | This was causing 5xx errors in production and qa. Eg, at: https://qa.fatcat.wiki/release/aaaaaaaaaaaaarceaaaaaaaaai/history
* add missing underline in deleted entity web viewBryan Newbold2019-12-091-1/+1
|
* improve previous commit (JATS abstract hack)Bryan Newbold2019-12-031-4/+6
|
* hack: remove enclosing JATS XML tags around abstractsBryan Newbold2019-12-031-1/+7
| | | | | | The more complete fix is to actually render the JATS to HTML and display that. This is just to fix a nit with the most common case of XML tags in abstracts.
* make file edit form hash values case insensitiveBryan Newbold2019-12-021-0/+3
| | | | | | | Test in previous commit. This fixes a user-reported 500 error when creating a file with SHA1/SHA256/MD5 hashes in upper-case.
* Merge branch 'martin-search-results-pagination' into 'master'Martin Czygan2019-11-155-18/+79
|\ | | | | | | | | Add basic pagination to search results See merge request webgroup/fatcat!4
| * gray out inactive navigation linksMartin Czygan2019-11-141-2/+2
| | | | | | | | | | | | | | | | As per [this issue](https://github.com/Semantic-Org/Semantic-UI/issues/1885#issuecomment-77619519), text colors are not supported in semantic ui. To not move text too much, gray out inactive links.
| * move pagination into macrosMartin Czygan2019-11-143-43/+51
| | | | | | | | | | | | | | | | | | Two new macros: * top_results(found) * bottom_results(found) wip: move pagination into macro
| * Add basic pagination to search resultsMartin Czygan2019-11-084-14/+67
| | | | | | | | | | | | | | | | | | | | | | | | The "deep paging problem" imposes some limit, which currently is a hardcoded default value, `deep_page_limit=2000` in `do_search`. Elasticsearch can be configured, too: > Note that from + size can not be more than the index.max_result_window index setting, which defaults to 10,000. -- https://www.elastic.co/guide/en/elasticsearch/reference/current/search-request-body.html#request-body-search-from-size
* | web: catch MacaroonInitExceptionBryan Newbold2019-11-121-0/+4
|/ | | | | Caught one of these in sentry. Probably due to a crawler? Or typing gibberish in the token form.