From e7c72cdee09d42b7d8afd9e2a2ebb7e9feeed94d Mon Sep 17 00:00:00 2001 From: Sreenketh Madgula Date: Thu, 28 Jan 2021 12:56:02 +0530 Subject: made README more readable; fixed some errors --- README.md | 28 +++++++++++++--------------- 1 file changed, 13 insertions(+), 15 deletions(-) diff --git a/README.md b/README.md index 548e200..681ece8 100644 --- a/README.md +++ b/README.md @@ -6,13 +6,13 @@ `fatcat-scholar` / Internet Archive Scholar =========================================== -This is source code for an experimental ("alpha") fulltext web search interface +This is source code for an experimental ("alpha") full-text web search interface over the 25+ million open research papers in the [fatcat](https://fatcat.wiki) catalog. A demonstration (pre-production) interface is available at . All of the heavy lifting of harvesting, crawling, and metadata corrections are -all handled by the fatcat service; this service is just a bare-bones, read-only +handled by the fatcat service; this service is just a bare-bones, read-only search interface. Unlike the basic fatcat.wiki search, this index allows querying the full content of papers when available. @@ -21,15 +21,15 @@ querying the full content of papers when available. This repository is fairly small and contains: -- `fatcat_scholar/`: Python code for web servce and indexing pipeline +- `fatcat_scholar/`: Python code for web serivce and indexing pipeline - `fatcat_scholar/templates/`: HTML template for web interface - `tests/`: Python test files - `proposals/`: design documentation and change proposals - `data/`: empty directory for indexing pipeline A data pipeline converts groups of one or more fatcat "release" entities -(grouped under a single "work") into a single search index document. -Elasticsearch is used as the fulltext search engine. A simple web interface +(grouped under a single "work" entitiy) into a single search index document. +Elasticsearch is used as the full-text search engine. A simple web interface parses search requests and formats Elasticsearch results with highlights and first-page thumbnails. @@ -47,23 +47,21 @@ Working on the indexing pipeline effectively requires internal access to the Internet Archive cluster and services, though some contributions and bugfixes are probably possible without staff access. -To install dependencies for the first time, then run the tests (to ensure -everything is working): +To install dependencies for the first time run: + `make dep` +then run the tests (to ensure everything is working): + `make test` - make dep - make test - -If developing the web interface, you will almost certainly need an example +While developing the web interface, you will almost certainly need an example database running locally. A docker-compose file in `extra/docker/` can be used to run Elasticsearch 7.x locally. The `make dev-index` command will reset the local index with the correct schema mapping, and index any intermediate files in the `./data/` directory. We don't have an out-of-the-box solution for non-IA staff at this step (yet). -After making changes to any user interface strings, the interface translation -file (".pot") needs to be updated with `make extract-i18n`. When these changes -are merged to master, the Weblate translation system will be updated -automatically. +After making changes to any user interface strings, the interface translation file (".pot") needs to be updated with +`make extract-i18n` +When these changes are merged to master, the Weblate translation system will be updated automatically. This repository uses `black` for code formatting; please run `make fmt` and `make lint` for submitting a pull request. -- cgit v1.2.3 From a092aeaac633e4dfc021cb730f6e7ace83fbb13d Mon Sep 17 00:00:00 2001 From: Sreenketh Madgula Date: Sat, 30 Jan 2021 12:16:05 +0530 Subject: made required changes --- README.md | 8 +++++--- 1 file changed, 5 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 681ece8..4e6a5d5 100644 --- a/README.md +++ b/README.md @@ -48,10 +48,14 @@ Internet Archive cluster and services, though some contributions and bugfixes are probably possible without staff access. To install dependencies for the first time run: + `make dep` + then run the tests (to ensure everything is working): + `make test` + While developing the web interface, you will almost certainly need an example database running locally. A docker-compose file in `extra/docker/` can be used to run Elasticsearch 7.x locally. The `make dev-index` command will reset the @@ -59,9 +63,7 @@ local index with the correct schema mapping, and index any intermediate files in the `./data/` directory. We don't have an out-of-the-box solution for non-IA staff at this step (yet). -After making changes to any user interface strings, the interface translation file (".pot") needs to be updated with -`make extract-i18n` -When these changes are merged to master, the Weblate translation system will be updated automatically. +After making changes to any user interface strings, the interface translation file (".pot") needs to be updated with `make extract-i18n`. When these changes are merged to master, the Weblate translation system will be updated automatically. This repository uses `black` for code formatting; please run `make fmt` and `make lint` for submitting a pull request. -- cgit v1.2.3 From 9edd9b1f7d0f78b792b1df0081b8030e9b995c90 Mon Sep 17 00:00:00 2001 From: Milo Ivir Date: Fri, 29 Jan 2021 21:42:00 +0000 Subject: Translated using Weblate (Croatian) Currently translated at 96.8% (154 of 159 strings) Translation: Internet Archive/Archive Scholar (web interface) Translate-URL: https://hosted.weblate.org/projects/internetarchive/fatcat-scholar/hr/ --- .../translations/hr/LC_MESSAGES/messages.mo | Bin 20732 -> 20693 bytes .../translations/hr/LC_MESSAGES/messages.po | 11 +++++------ 2 files changed, 5 insertions(+), 6 deletions(-) diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo index 497bd07..cd39da2 100644 Binary files a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo and b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo differ diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po index 521d0ab..73ca1a1 100644 --- a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po +++ b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po @@ -9,7 +9,7 @@ msgstr "" "Project-Id-Version: PROJECT VERSION\n" "Report-Msgid-Bugs-To: bnewbold@archive.org\n" "POT-Creation-Date: 2021-01-25 22:58-0800\n" -"PO-Revision-Date: 2021-01-29 15:32+0000\n" +"PO-Revision-Date: 2021-01-29 23:34+0000\n" "Last-Translator: Milo Ivir \n" "Language-Team: Croatian \n" @@ -753,7 +753,7 @@ msgstr "minijatura teksta" #: fatcat_scholar/templates/search_macros.html:111 msgid "Publisher / doi.org" -msgstr "Izdavač / doi.org" +msgstr "Izdavač/doi.org" #: fatcat_scholar/templates/search_macros.html:144 #, python-format @@ -795,10 +795,9 @@ msgid "" "also visit the " "original URL." msgstr "" -"Primjerak ovog rada bio je dostupan na javnom webu i sačuvana je na Wayback " -"Machine stranicama. Datum snimke je od %(capture_year)s; također možeš " -"posjetiti izvorni " -"URL." +"Primjerak ovog rada bio je dostupan na javnom webu i sačuvan je na Wayback " +"Machine stranicama. Snimljeno %(capture_year)s. Izvorni URL." #: fatcat_scholar/templates/search_macros.html:290 msgid "" -- cgit v1.2.3 From bb93cebcf5f5ffa3af9ed8b4a49d352b5f4963f5 Mon Sep 17 00:00:00 2001 From: Milo Ivir Date: Sat, 30 Jan 2021 13:55:35 +0000 Subject: Translated using Weblate (Croatian) Currently translated at 97.4% (155 of 159 strings) Translation: Internet Archive/Archive Scholar (web interface) Translate-URL: https://hosted.weblate.org/projects/internetarchive/fatcat-scholar/hr/ --- .../translations/hr/LC_MESSAGES/messages.mo | Bin 20693 -> 21074 bytes .../translations/hr/LC_MESSAGES/messages.po | 20 +++++++++++--------- 2 files changed, 11 insertions(+), 9 deletions(-) diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo index cd39da2..c03bfaa 100644 Binary files a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo and b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo differ diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po index 73ca1a1..715dd4c 100644 --- a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po +++ b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po @@ -9,7 +9,7 @@ msgstr "" "Project-Id-Version: PROJECT VERSION\n" "Report-Msgid-Bugs-To: bnewbold@archive.org\n" "POT-Creation-Date: 2021-01-25 22:58-0800\n" -"PO-Revision-Date: 2021-01-29 23:34+0000\n" +"PO-Revision-Date: 2021-01-31 14:41+0000\n" "Last-Translator: Milo Ivir \n" "Language-Team: Croatian \n" @@ -400,8 +400,8 @@ msgid "" "All Internet Archive preservation copy links have the same style and icon. " "Content from the Wayback Machine looks like this." msgstr "" -"Sve poveznice za Internet Archive primjerke imaju isti stil i ikonu. Sadržaj " -"od Wayback Machine izgleda ovako." +"Sve poveznice za sačuvane primjerke u arhivu Internet Archive imaju isti " +"stil i ikonu. Sadržaj od Wayback Machine izgleda ovako." #: fatcat_scholar/templates/help.html:122 msgid "" @@ -416,9 +416,9 @@ msgid "" "stored in general Internet Archive digital collections (as opposed to the " "web archive)" msgstr "" -"Neki sačuvani sadržaji, osobito stariji radovi u javnom vlasništvu, mogu " -"biti spremljeni u digitalnim zbirkama od Internet Archive (za razliku od web-" -"arhiva)" +"Neki sačuvani sadržaji, osobito stariji radovi u javnom vlasništvu, su možda " +"biti spremljeni u općim digitalnim zbirkama arhiva Internet Archive (za " +"razliku od web-arhiva)" #: fatcat_scholar/templates/help.html:130 msgid "" @@ -621,7 +621,7 @@ msgid "" "other scholarly documents preserved in the Internet Archive." msgstr "" "Ovaj indeks pretraživanja teksta obuhvaća preko 25 milijuna istraživačkih " -"članaka i drugih znanstvenih dokumenata sačuvanih u „Internet Archive”." +"članaka i drugih znanstvenih dokumenata sačuvanih u arhivu Internet Archive." #: fatcat_scholar/templates/home.html:24 msgid "" @@ -777,7 +777,7 @@ msgstr "stranica" #: fatcat_scholar/templates/search_macros.html:236 msgid "Preserved Fulltext" -msgstr "Očuvan tekst" +msgstr "Sačuvani tekst" #: fatcat_scholar/templates/search_macros.html:259 msgid "Web Archive Capture" @@ -803,7 +803,7 @@ msgstr "" msgid "" "The Internet Archive has a preservation copy of this work in our general " "collections." -msgstr "Internet Archive ima primjerak ovog djela u našim općim zbirkama." +msgstr "Internet Archive ima primjerak ovog rada u našim općim zbirkama." #: fatcat_scholar/templates/search_macros.html:292 msgid "" @@ -836,6 +836,8 @@ msgid "" "able to obtain a copy of this work to read from the publisher, platforms, or " "institutional libraries." msgstr "" +"Još ne znamo za javno sačuvan primjerak ovog rada. Primjerak ovog rada se " +"možda može dobiti od izdavača, platformi ili institucionalnih knjižnica." #: fatcat_scholar/templates/search_macros.html:327 #, python-format -- cgit v1.2.3 From 0104be429cc40054f02a1161babc5b9128e8253c Mon Sep 17 00:00:00 2001 From: Milo Ivir Date: Sun, 31 Jan 2021 19:31:42 +0000 Subject: Translated using Weblate (Croatian) Currently translated at 97.4% (155 of 159 strings) Translation: Internet Archive/Archive Scholar (web interface) Translate-URL: https://hosted.weblate.org/projects/internetarchive/fatcat-scholar/hr/ --- .../translations/hr/LC_MESSAGES/messages.mo | Bin 21074 -> 21069 bytes .../translations/hr/LC_MESSAGES/messages.po | 8 ++++---- 2 files changed, 4 insertions(+), 4 deletions(-) diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo index c03bfaa..dd6afbd 100644 Binary files a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo and b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo differ diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po index 715dd4c..fdbfd58 100644 --- a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po +++ b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po @@ -9,7 +9,7 @@ msgstr "" "Project-Id-Version: PROJECT VERSION\n" "Report-Msgid-Bugs-To: bnewbold@archive.org\n" "POT-Creation-Date: 2021-01-25 22:58-0800\n" -"PO-Revision-Date: 2021-01-31 14:41+0000\n" +"PO-Revision-Date: 2021-02-01 19:41+0000\n" "Last-Translator: Milo Ivir \n" "Language-Team: Croatian \n" @@ -416,9 +416,9 @@ msgid "" "stored in general Internet Archive digital collections (as opposed to the " "web archive)" msgstr "" -"Neki sačuvani sadržaji, osobito stariji radovi u javnom vlasništvu, su možda " -"biti spremljeni u općim digitalnim zbirkama arhiva Internet Archive (za " -"razliku od web-arhiva)" +"Neki sačuvani sadržaji, osobito stariji radovi u javnom vlasništvu, možda su " +"spremljeni u općim digitalnim zbirkama arhiva Internet Archive (za razliku " +"od web-arhiva)" #: fatcat_scholar/templates/help.html:130 msgid "" -- cgit v1.2.3 From 1ebe9cc8e4a6968a7987a5d825ee1efbb338bba4 Mon Sep 17 00:00:00 2001 From: Nikita Epifanov Date: Wed, 3 Feb 2021 14:40:41 +0000 Subject: Translated using Weblate (Russian) Currently translated at 49.6% (79 of 159 strings) Translation: Internet Archive/Archive Scholar (web interface) Translate-URL: https://hosted.weblate.org/projects/internetarchive/fatcat-scholar/ru/ --- .../translations/ru/LC_MESSAGES/messages.mo | Bin 7061 -> 8393 bytes .../translations/ru/LC_MESSAGES/messages.po | 26 ++++++++++----------- 2 files changed, 12 insertions(+), 14 deletions(-) diff --git a/fatcat_scholar/translations/ru/LC_MESSAGES/messages.mo b/fatcat_scholar/translations/ru/LC_MESSAGES/messages.mo index 6d03e78..8234434 100644 Binary files a/fatcat_scholar/translations/ru/LC_MESSAGES/messages.mo and b/fatcat_scholar/translations/ru/LC_MESSAGES/messages.mo differ diff --git a/fatcat_scholar/translations/ru/LC_MESSAGES/messages.po b/fatcat_scholar/translations/ru/LC_MESSAGES/messages.po index 7c9c23c..2dfb56e 100644 --- a/fatcat_scholar/translations/ru/LC_MESSAGES/messages.po +++ b/fatcat_scholar/translations/ru/LC_MESSAGES/messages.po @@ -4,13 +4,14 @@ # FIRST AUTHOR , 2020. # Artem , 2020, 2021. # Bryan Newbold , 2020, 2021. +# Nikita Epifanov , 2021. msgid "" msgstr "" "Project-Id-Version: PROJECT VERSION\n" -"Report-Msgid-Bugs-To: EMAIL@ADDRESS\n" +"Report-Msgid-Bugs-To: bnewbold@archive.org\n" "POT-Creation-Date: 2021-01-25 22:58-0800\n" -"PO-Revision-Date: 2021-01-26 06:46+0000\n" -"Last-Translator: Artem \n" +"PO-Revision-Date: 2021-02-04 14:41+0000\n" +"Last-Translator: Nikita Epifanov \n" "Language-Team: Russian \n" "Language: ru\n" @@ -108,10 +109,8 @@ msgid "About %(website_name)s" msgstr "О %(website_name)s" #: fatcat_scholar/templates/about.html:8 -#, fuzzy -#| msgid "Internet Archive Scholar" msgid "About Internet Archive Scholar" -msgstr "Академия архива Интернета" +msgstr "О Академии архива Интернета" #: fatcat_scholar/templates/base.html:12 msgid "Internet Archive Scholar" @@ -240,18 +239,17 @@ msgid "Try It" msgstr "Попробовать" #: fatcat_scholar/templates/help.html:22 -#, fuzzy -#| msgid "User Guide" msgid "Scholar User Guide" -msgstr "Руководство пользователя" +msgstr "Научное руководство пользователя" #: fatcat_scholar/templates/help.html:23 -#, fuzzy, python-format +#, python-format msgid "See also: About Scholarly Search" -msgstr "См. также: О научном поиске " +msgstr "" +"См. также: \n" +"О научном поиске" #: fatcat_scholar/templates/help.html:25 -#, fuzzy msgid "" "This service provides fulltext searching over research publications archived in " @@ -260,7 +258,7 @@ msgid "" "government reports, and more." msgstr "" "Данный сервис обеспечивает полнотекстовый поиск по всем исследовательским " +"text_search\">полнотекстовый поиск по всем исследовательским " "публикациям, заархивированным в различных коллекциях Архива Интернета. Он " "включает в себя материалы из области естественных наук, гуманитарных наук, " "биомедицины, искусства, истории, промышленных исследований, " @@ -839,7 +837,7 @@ msgstr "Издатель / doi.org" #: fatcat_scholar/templates/search_macros.html:144 #, python-format msgid "%(year)s" -msgstr "" +msgstr "%(year)s" #: fatcat_scholar/templates/search_macros.html:197 msgid "unknown" -- cgit v1.2.3 From 6e077c4f5677e8ed69f572729076a225a7cb6aa9 Mon Sep 17 00:00:00 2001 From: Milo Ivir Date: Wed, 3 Feb 2021 13:28:55 +0000 Subject: Translated using Weblate (Croatian) Currently translated at 100.0% (159 of 159 strings) Translation: Internet Archive/Archive Scholar (web interface) Translate-URL: https://hosted.weblate.org/projects/internetarchive/fatcat-scholar/hr/ --- .../translations/hr/LC_MESSAGES/messages.mo | Bin 21069 -> 22590 bytes .../translations/hr/LC_MESSAGES/messages.po | 12 +++++++++++- 2 files changed, 11 insertions(+), 1 deletion(-) diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo index dd6afbd..69adeca 100644 Binary files a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo and b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.mo differ diff --git a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po index fdbfd58..a4c9a12 100644 --- a/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po +++ b/fatcat_scholar/translations/hr/LC_MESSAGES/messages.po @@ -9,7 +9,7 @@ msgstr "" "Project-Id-Version: PROJECT VERSION\n" "Report-Msgid-Bugs-To: bnewbold@archive.org\n" "POT-Creation-Date: 2021-01-25 22:58-0800\n" -"PO-Revision-Date: 2021-02-01 19:41+0000\n" +"PO-Revision-Date: 2021-02-04 14:41+0000\n" "Last-Translator: Milo Ivir \n" "Language-Team: Croatian \n" @@ -409,6 +409,9 @@ msgid "" "other alternative version of the work, the access link has an indicator. You " "can get details and view all versions by clicking on the primary title link" msgstr "" +"Ako se u sačuvanom primjerku radi o predizdanju, manuskriptu autora ili neke " +"druge alternativne verzije rada, poveznica za pristup sadrži oznaku. Detalje " +"i prikaz svih verzija možeš dobiti pritiskom poveznice za primarni naslov" #: fatcat_scholar/templates/help.html:126 msgid "" @@ -425,6 +428,8 @@ msgid "" "Digitized copies of works on microfilm may be linked to experimentally. " "Access may be limited to controlled lending" msgstr "" +"Digitalizirani primjerci rada na mikrofilmu mogu se eksperimentalno " +"povezati. Pristup se može ograničiti na kontroliranu posudbu" #: fatcat_scholar/templates/help.html:134 msgid "" @@ -432,6 +437,8 @@ msgid "" "record\" of a research publication, but content is not always accessible to " "the general public" msgstr "" +"Odredišna stranica izdavača ovlašteni je izvor za „verziju zapisa” " +"istraživačke publikacije, ali sadržaj nije uvijek dostupan široj javnosti" #: fatcat_scholar/templates/help.html:138 msgid "" @@ -545,6 +552,9 @@ msgid "" "\"persistent identifiers\" that uniquely identify the specific version of " "the work. These are usually hyperlinks." msgstr "" +"Ispod rezultata pretraživanja i alternativnih popisa verzija nalaze se svi " +"poznati „trajni identifikatori” koji jedinstveno identificiraju određenu " +"verziju rada. Uobičajeno su to poveznice." #: fatcat_scholar/templates/help.html:210 msgid "" -- cgit v1.2.3