index
:
fatcat
bnewbold-doaj-article-harvest
bnewbold-elastic-extras
bnewbold-openapi-client-generator-v601
bnewbold-pythonclient-types
bnewbold-redoc
bnewbold-rust-gen-v5
bnewbold-sitemap
bnewbold-ubuntu-jammy
cockroach
confluent-kafka
master
preview
x-attic-auth-other-macaroon-lib
x-attic-camp
x-attic-changelog-export
x-attic-chocula
x-attic-cockroach
x-attic-golang
x-attic-more-importers
x-attic-preview
x-attic-python-rust-hacks
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
python
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
clean DOI: ban all non-ASCII characters
Bryan Newbold
2020-11-19
1
-1
/
+4
*
normal: handle langdetect of 'zh-cn' (not len=2)
Bryan Newbold
2020-11-19
1
-0
/
+3
*
tweak DOAJ importer class args and default for do_updates
Bryan Newbold
2020-11-19
1
-2
/
+2
*
show DOAJ (and dblp) identifiers in release view
Bryan Newbold
2020-11-19
1
-1
/
+7
*
if a release has DOAJ article id, count as OA
Bryan Newbold
2020-11-19
1
-0
/
+3
*
implement remainder of DOAJ article importer
Bryan Newbold
2020-11-19
3
-68
/
+168
*
handle more non-ASCII DOI cases
Bryan Newbold
2020-11-19
1
-1
/
+3
*
more python normalizers, and move from importer common
Bryan Newbold
2020-11-19
2
-154
/
+326
*
initial implementation of DOAJ importer
Bryan Newbold
2020-11-19
4
-0
/
+387
*
html ingest: actual xhtml mimetype
Bryan Newbold
2020-11-16
1
-2
/
+2
*
ingest tool: support for setting ingest type
Bryan Newbold
2020-11-06
2
-6
/
+10
*
html ingest: remaining implementation
Bryan Newbold
2020-11-06
1
-22
/
+19
*
ingest: fix XML ingest test file
Bryan Newbold
2020-11-05
1
-1
/
+1
*
ingest: progress on HTML ingest
Bryan Newbold
2020-11-05
3
-16
/
+74
*
ingest: initial 'web' worker implementation
Bryan Newbold
2020-11-05
3
-67
/
+301
*
refactor: white/black -> allow/block
Bryan Newbold
2020-11-05
1
-4
/
+4
*
ingest: whitelist -> allowlist
Bryan Newbold
2020-11-05
2
-6
/
+6
*
ingest: tests for basic XML ingest
Bryan Newbold
2020-11-05
2
-0
/
+18
*
ingest: basic checks for ingest_type
Bryan Newbold
2020-11-05
3
-4
/
+36
*
normalizer: filter out a specific non-ASCII character in DOI
Bryan Newbold
2020-11-04
1
-1
/
+3
*
entity updates: don't ingest JSTOR DOI prefixes
Bryan Newbold
2020-10-23
1
-0
/
+2
*
entity updater: new work update feed (ident and changelog metadata only)
Bryan Newbold
2020-10-16
2
-2
/
+26
*
container coverage: add keeper link and KBART holdings list
Bryan Newbold
2020-10-13
1
-0
/
+11
*
release view: remove abiguous OA status indicator
Bryan Newbold
2020-10-13
1
-4
/
+0
*
container view: fix non-OA empty box
Bryan Newbold
2020-10-13
1
-3
/
+3
*
coverage: show counts and fraction in tooltip of coverage bars
Bryan Newbold
2020-10-13
1
-5
/
+5
*
chocula importer: small tweaks to update behavior
Bryan Newbold
2020-10-08
1
-8
/
+6
*
elastic transform: more preservation keepers
Bryan Newbold
2020-10-08
1
-1
/
+2
*
Merge branch 'bnewbold-202009-polish' into 'master'
Martin Czygan
2020-09-29
10
-124
/
+159
|
\
|
*
coverage: handle the case of hits, but none with years
Bryan Newbold
2020-09-17
1
-4
/
+5
|
*
web: handle unknown CSL style as a cleaner 400 page
Bryan Newbold
2020-09-17
2
-1
/
+7
|
*
web: update sub-resource integrity and pre-loading
Bryan Newbold
2020-09-17
1
-0
/
+13
|
*
lint cleanups
Bryan Newbold
2020-09-17
2
-3
/
+0
|
*
web: route constraints on fcids and UUIDs
Bryan Newbold
2020-09-17
2
-101
/
+103
|
*
container view: only show OA indicator when known
Bryan Newbold
2020-09-17
1
-5
/
+1
|
*
web container view: hide preservation when no releases
Bryan Newbold
2020-09-17
1
-8
/
+6
|
*
web toml editing: remove sub-entities from TOML
Bryan Newbold
2020-09-17
1
-0
/
+4
|
*
coverage search: pretty display for ES query errors
Bryan Newbold
2020-09-17
2
-1
/
+19
|
*
coverage: clarify available/accessible terminology
Bryan Newbold
2020-09-17
1
-1
/
+1
*
|
update keepers links to keepers.issn.org
Bryan Newbold
2020-09-28
2
-8
/
+8
*
|
address spammy datacite titles
Martin Czygan
2020-09-23
2
-0
/
+25
|
/
*
homepage: small grammar tweaks (The/the)
Bryan Newbold
2020-09-11
1
-3
/
+3
*
ingest: default to crawl protocols.io DOIs
Bryan Newbold
2020-09-10
1
-0
/
+2
*
datacite: handle case of empty-string version
Bryan Newbold
2020-09-10
3
-2
/
+3
*
remove spurious print statement
Bryan Newbold
2020-09-03
1
-1
/
+0
*
generic file entity clean-ups as part of file_meta importer
Bryan Newbold
2020-09-02
3
-0
/
+149
*
Merge branch 'bnewbold-filemeta'
Bryan Newbold
2020-08-27
5
-0
/
+162
|
\
|
*
fix comment typo (thanks martin)
Bryan Newbold
2020-08-27
1
-1
/
+1
|
*
fixes and test coverage for file_meta importer
Bryan Newbold
2020-08-21
4
-6
/
+82
|
*
initial implementation of file_meta importer
Bryan Newbold
2020-08-21
3
-0
/
+86
[prev]
[next]