index
:
fatcat
bnewbold-doaj-article-harvest
bnewbold-elastic-extras
bnewbold-openapi-client-generator-v601
bnewbold-pythonclient-types
bnewbold-redoc
bnewbold-rust-gen-v5
bnewbold-sitemap
bnewbold-ubuntu-jammy
cockroach
confluent-kafka
master
preview
x-attic-auth-other-macaroon-lib
x-attic-camp
x-attic-changelog-export
x-attic-chocula
x-attic-cockroach
x-attic-golang
x-attic-more-importers
x-attic-preview
x-attic-python-rust-hacks
[no description]
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
|
gitlab CI: explicitly use xenial tag of image
Bryan Newbold
2020-12-11
1
-1
/
+1
*
|
docker xenial base image: include python3.8
Bryan Newbold
2020-12-11
1
-1
/
+6
*
|
HACK: squash intermitent failure of detect_text_lang() test
Bryan Newbold
2020-12-11
1
-1
/
+2
*
|
guide: small updates to container extra schema notes (from dblp work)
Bryan Newbold
2020-12-11
1
-2
/
+7
*
|
bulk edits: note ORCID update
Bryan Newbold
2020-12-11
1
-1
/
+5
*
|
docker: how to push to dockerhub
Bryan Newbold
2020-12-11
1
-0
/
+4
*
|
Merge branch 'bnewbold-doaj-metadata' into 'master'
Martin Czygan
2020-11-24
37
-1549
/
+2845
|
\
\
|
*
|
cargo: update sentry to fix memory initialization issue
Bryan Newbold
2020-11-20
2
-274
/
+332
|
*
|
DOAJ: remove accidentally commited 'skip' of a test
Bryan Newbold
2020-11-20
1
-1
/
+0
|
*
|
langdetect: more text for 'zh' test case
Bryan Newbold
2020-11-20
1
-1
/
+1
|
*
|
DOAJ: update importer README with example invocation
Bryan Newbold
2020-11-20
1
-0
/
+7
|
*
|
crossref+datacite: remove confusing early update bail
Bryan Newbold
2020-11-20
2
-4
/
+0
|
*
|
doaj: fix update code path (getattr not __dict__)
Bryan Newbold
2020-11-20
3
-15
/
+70
|
*
|
DOAJ: handle empty identifier 'id' case
Bryan Newbold
2020-11-20
1
-0
/
+2
|
*
|
clean DOI: ban all non-ASCII characters
Bryan Newbold
2020-11-19
1
-1
/
+4
|
*
|
normal: handle langdetect of 'zh-cn' (not len=2)
Bryan Newbold
2020-11-19
1
-0
/
+3
|
*
|
update fatcatd rust code for 'oai' external identifier
Bryan Newbold
2020-11-19
4
-11
/
+189
|
*
|
codegen rust schema crate
Bryan Newbold
2020-11-19
6
-3
/
+20
|
*
|
codegen python openapi client
Bryan Newbold
2020-11-19
2
-4
/
+36
|
*
|
schema: also add 'oai' identifer (OAI-PMH) for releases
Bryan Newbold
2020-11-19
2
-2
/
+9
|
*
|
tweak DOAJ importer class args and default for do_updates
Bryan Newbold
2020-11-19
1
-2
/
+2
|
*
|
show DOAJ (and dblp) identifiers in release view
Bryan Newbold
2020-11-19
1
-1
/
+7
|
*
|
if a release has DOAJ article id, count as OA
Bryan Newbold
2020-11-19
1
-0
/
+3
|
*
|
implement remainder of DOAJ article importer
Bryan Newbold
2020-11-19
3
-68
/
+168
|
*
|
handle more non-ASCII DOI cases
Bryan Newbold
2020-11-19
1
-1
/
+3
|
*
|
more python normalizers, and move from importer common
Bryan Newbold
2020-11-19
2
-154
/
+326
|
*
|
initial implementation of DOAJ importer
Bryan Newbold
2020-11-19
4
-0
/
+387
|
*
|
python API client: resolve warning about '\d' in string
Bryan Newbold
2020-11-19
1
-2
/
+2
|
*
|
rustfmt
Bryan Newbold
2020-11-19
5
-87
/
+138
|
*
|
rust: fatcatd changes for DOAJ+dblp identifiers
Bryan Newbold
2020-11-19
6
-949
/
+1062
|
*
|
codegen rust crate for v0.3.3
Bryan Newbold
2020-11-19
8
-227
/
+244
|
*
|
codegen python client library for v0.3.3
Bryan Newbold
2020-11-19
7
-16
/
+80
|
*
|
schema: DOAJ+dblp ext_ids; bump to v0.3.3
Bryan Newbold
2020-11-19
2
-1
/
+25
|
/
/
*
|
ingest and proposal updates
Bryan Newbold
2020-11-19
2
-0
/
+45
*
|
Merge branch 'bnewbold-xml-html-ingest' into 'master'
Martin Czygan
2020-11-19
10
-66
/
+409
|
\
\
|
|
/
|
/
|
|
*
html ingest: actual xhtml mimetype
Bryan Newbold
2020-11-16
1
-2
/
+2
|
*
ingest tool: support for setting ingest type
Bryan Newbold
2020-11-06
2
-6
/
+10
|
*
html ingest: remaining implementation
Bryan Newbold
2020-11-06
1
-22
/
+19
|
*
ingest: fix XML ingest test file
Bryan Newbold
2020-11-05
1
-1
/
+1
|
*
ingest: progress on HTML ingest
Bryan Newbold
2020-11-05
3
-16
/
+74
|
*
ingest: initial 'web' worker implementation
Bryan Newbold
2020-11-05
3
-67
/
+301
|
*
refactor: white/black -> allow/block
Bryan Newbold
2020-11-05
1
-4
/
+4
|
*
ingest: whitelist -> allowlist
Bryan Newbold
2020-11-05
2
-6
/
+6
|
*
ingest: tests for basic XML ingest
Bryan Newbold
2020-11-05
2
-0
/
+18
|
*
ingest: basic checks for ingest_type
Bryan Newbold
2020-11-05
3
-4
/
+36
|
/
*
normalizer: filter out a specific non-ASCII character in DOI
Bryan Newbold
2020-11-04
1
-1
/
+3
*
entity updates: don't ingest JSTOR DOI prefixes
Bryan Newbold
2020-10-23
1
-0
/
+2
*
Merge branch 'bnewbold-scholar-pipeline' into 'master'
bnewbold
2020-10-20
2
-2
/
+26
|
\
|
*
entity updater: new work update feed (ident and changelog metadata only)
Bryan Newbold
2020-10-16
2
-2
/
+26
|
/
*
bulk citation graph workflow proposal
Bryan Newbold
2020-10-15
1
-0
/
+160
[prev]
[next]