| Commit message (Collapse) | Author | Age | Files | Lines |
| |
|
| |
|
| |
|
|
|
|
| |
Only Cargo.toml project metadata updated.
|
|\
| |
| |
| |
| | |
search: assume * when q is not set or empty
See merge request webgroup/fatcat!51
|
| |
| |
| |
| | |
An example would be a blank search from a container details page.
|
|\ \
| |/
|/|
| |
| | |
tweaks to search result pages
See merge request webgroup/fatcat!50
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
This is also back-ported from covid19.fatcat.wiki, though with some more
tweaks on top.
The changes are:
- show original title if available (usually non-English)
- move release_type label to title line suffix, and only show if not a
"paper"
- show publication status and withdrawl as text after the journal title,
not as a label
|
| |
| |
| |
| | |
These are back-ported fixes from covid19.fatcat.wiki
|
|\ \
| | |
| | |
| | |
| | |
| | |
| | | |
fix ident=None broken links
Closes #3
See merge request webgroup/fatcat!49
|
| |/
| |
| |
| |
| |
| | |
On web interface views for revisions, we had a bunch of broken links
because the ident is "None". This commit fixes these by removing the
links.
|
|\ \
| | |
| | |
| | |
| | | |
datacite: fix type error
See merge request webgroup/fatcat!48
|
|/ /
| |
| |
| |
| |
| |
| | |
Up to now, we expected the description to be a string or list. Add
handling for int as well.
First appeared: Apr 22 19:58:39.
|
|\ \
| | |
| | |
| | |
| | |
| | |
| | | |
into 'master'
datacite: fix a raw name constraint violation
See merge request webgroup/fatcat!47
|
| |/
| |
| |
| |
| |
| |
| | |
It was possible that contribs got added which had no raw name. One
example would be a name consisting of whitespace only.
This fix adds a final check for this case.
|
|\ \
| |/
|/|
| |
| | |
fixes for changelog elasticsearch worker
See merge request webgroup/fatcat!46
|
| | |
|
|/
|
|
|
| |
The API fetch update may be needed for old changelog entries in the
kafka feed.
|
|\
| |
| |
| |
| | |
py37 cleanups
See merge request webgroup/fatcat!44
|
| | |
|
| | |
|
| | |
|
| |
| |
| |
| |
| | |
We had some pre-3.6 work arounds. Also seems like a reasonable time to
update all depdencies to most recent versions.
|
|\ \
| | |
| | |
| | |
| | | |
derive changelog worker from release worker
See merge request webgroup/fatcat!43
|
| | |
| | |
| | |
| | |
| | | |
Early versions of changelog entries may not have all the fields
required for the current transform.
|
|\ \ \
| | | |
| | | |
| | | |
| | | | |
changelog: extend release_types considered documents
See merge request webgroup/fatcat!42
|
| | | |
| | | |
| | | |
| | | |
| | | | |
No partial docs (e.g. abstract), too generic components and entries, not
HTML blogs.
|
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | |
| | | | |
according to release_rev.release_type, we have 29 values:
fatcat_prod=# select release_type, count(release_type) from release_rev group by release_type;
release_type | count
-------------------+-----------
abstract | 2264
article | 6371076
article-journal | 101083841
article-newspaper | 17062
book | 1676941
chapter | 13914854
component | 58990
dataset | 6860325
editorial | 133573
entry | 1628487
graphic | 1809471
interview | 19898
legal_case | 3581
legislation | 1626
letter | 275119
paper-conference | 6074669
peer_review | 30581
post | 245807
post-weblog | 135
report | 1010699
retraction | 1292
review-book | 96219
software | 316
song | 24027
speech | 4263
standard | 312364
stub | 1036813
thesis | 414397
| 0
(29 rows)
|
| |_|/
|/| | |
|
| | | |
|
| | | |
|
| | | |
|
| | | |
|
| | | |
|
| | |
| | |
| | |
| | | |
Used to create bnewbold/fatcat-test-base image
|
| |/
|/|
| |
| | |
Goal is to speed up CI runs.
|
|/
|
|
| |
Not sure why things build without this.
|
|
|
|
|
|
| |
Required updating to newer 'buster' Debian distro, and a newer rust
release to work around a Docker/OCI containerization issue with older
docker images.
|
| |
|
|
|
|
| |
Also updates dependencies.
|
| |
|
|
|
|
|
| |
- don't do expanded and regular release dumps
- default to sqldump_public for item name (as that is common-case)
|
|\
| |
| |
| |
| | |
beautifulsoup XML parsing: .string vs. .get_text()
See merge request webgroup/fatcat!40
|
| |
| |
| |
| |
| |
| |
| | |
The primary motivation for this change is that fatcat *requires* a
non-empty title for each release entity. Pubmed/Medline occasionally
indexes just a VenacularTitle with no ArticleTitle for foreign
publications, and currently those records don't end up in fatcat at all.
|
| | |
|
| |
| |
| |
| | |
See previous pubmed commit for details.
|
| |
| |
| |
| |
| |
| |
| |
| |
| |
| |
| | |
Yikes! Apparently when a tag has child tags, .string will return None
instead of all the strings. .get_text() returns all of it:
https://www.crummy.com/software/BeautifulSoup/bs4/doc/#get-text
https://www.crummy.com/software/BeautifulSoup/bs4/doc/#string
I've things like identifiers as .string, when we expect only a single
string inside.
|
|\ \
| | |
| | |
| | |
| | | |
proposal: fuzzy matching
See merge request webgroup/fatcat!39
|
|/ / |
|
| | |
|