summaryrefslogtreecommitdiffstats
path: root/TODO
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@robocracy.org>2019-01-17 10:43:37 -0800
committerBryan Newbold <bnewbold@robocracy.org>2019-01-17 10:43:37 -0800
commit3ad702e413c084590505cff16f9532b236718049 (patch)
tree603a50d9187572e8bce2773cce6a8dc3c0c4ce33 /TODO
parent3b21f3003bb6601e80b158ae6577d5646cf754c6 (diff)
downloadfatcat-3ad702e413c084590505cff16f9532b236718049.tar.gz
fatcat-3ad702e413c084590505cff16f9532b236718049.zip
update TODO
Diffstat (limited to 'TODO')
-rw-r--r--TODO29
1 files changed, 13 insertions, 16 deletions
diff --git a/TODO b/TODO
index 6417668d..5075f10a 100644
--- a/TODO
+++ b/TODO
@@ -1,32 +1,24 @@
## In Progress
-- basic python tests for editgroup, annotation, submission changes
-- python tests for new autoaccept behavior
-- python tests for citation table storage efficiency changes
- => should there be a distinction between empty list and no references?
- yes, eg if expanded or not hidden
- => postgres manual checks that this is working
- => also benchmark (both speed and efficiency)
+- check that any needed/new indices are in place
+ => seems to at least superficially work
+- benchmark citation efficiency (in QA)
+
+- all query params need to be strings, and parse in rust :(
+ since=(datetime.datetime.utcnow() + datetime.timedelta(seconds=1)).isoformat()+"Z"
+- doc: python client API needs to have booleans set as, eg, 'true'/'false' (str) (!?!?)
+ "note that non-required or collection query parameters will ignore garbage values, rather than causing a 400 response"
## Next Up
- "don't clobber" mode/flag for crossref import (and others?)
-- update_file requires 'id'. should it be 'ident'?
- => something different about file vs. release
-- guide updates for auth
-- refactor webface views to use shared entity_view.html template
-- handle 'wip' status entities in web UI
- elastic inserter should handle deletions and redirects; if state isn't
active, delete the document
=> don't delete, just store state. but need to "blank" redirects and WIP so
they don't show up in results
=> refactor inserter to be a class (eg, for command line use)
=> end-to-end test of this behavior?
-- date handling is really pretty bad for releases; mangling those Jan1/Dec31
- => elastic schema should have a year field (integer)
-- document: elastic query date syntax is like: date:[2018-10-01 TO 2018-12-31]
-- elastic transform should only include authors, not editors (?)
- webcapture timestamp schema cleanup (both CDX and base)
=> dt.to_rfc3339_opts(SecondsFormat::Secs, true)
=> but this is mostly buried in serialization code?
@@ -43,6 +35,9 @@
- handle very large author/reference lists (instead of dropping)
=> https://api.crossref.org/v1/works/http://dx.doi.org/10.1007/978-3-319-46095-6_7
=> 7000+ authors (!)
+- guide updates for auth
+- refactor webface views to use shared entity_view.html template
+- handle 'wip' status entities in web UI
## Bugs (or at least need tests)
@@ -151,6 +146,7 @@ new importers:
## Schema / Entity Fields
+- elastic transform should only include authors, not editors (?)
- arxiv_id field (keep flip-flopping)
- original_title field (internationalization, "original language")
- `doi` field for containers (at least for "journal" type; maybe for "series"
@@ -162,6 +158,7 @@ new importers:
## Other / Backburner
+- document: elastic query date syntax is like: date:[2018-10-01 TO 2018-12-31]
- fileset/webcapture webface anything
- display abstracts better. no hashes or metadata; prefer plain or HTML,
convert JATS if necessary