{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Journal Names\n", "\n", "Questions in the context of fuzzy matching.\n", "\n", "* How many journal names appear more than once?\n", "* What is the average length of the duplicated names vs the unique names?\n", "\n", "\n", "Input file is a single larger JSON, mapping names to issns.\n", "```json\n", "{\n", " \"Acta Orientalia.\": [\n", " \"0001-6438\"\n", " ],\n", " \"Acta Orientalia (København)\": [\n", " \"0001-6438\"\n", " ],\n", "..\n", "```" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import json\n", "import pandas as pd" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "with open(\"../data/name_to_issn.json\") as f:\n", " mapping = json.load(f)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We have about 3M keys." ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "2929727" ] }, "execution_count": 3, "metadata": {}, "output_type": "execute_result" } ], "source": [ "len(mapping)" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "df = pd.DataFrame(((k, len(v)) for k, v in mapping.items()), columns=[\"name\", \"issn_count\"])" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "2929727" ] }, "execution_count": 5, "metadata": {}, "output_type": "execute_result" } ], "source": [ "len(df)" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
0Acta Orientalia.1
1Acta Orientalia (København)1
2˜The œpublishers weekly.1
3Publishers weekly1
4ASMT news1
\n", "
" ], "text/plain": [ " name issn_count\n", "0 Acta Orientalia. 1\n", "1 Acta Orientalia (København) 1\n", "2 ˜The œpublishers weekly. 1\n", "3 Publishers weekly 1\n", "4 ASMT news 1" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.head()" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [], "source": [ "unique_name = df[df.issn_count == 1]" ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [], "source": [ "repeated_names = df[df.issn_count > 1]" ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "194241" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "len(repeated_names)" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "0.06630003409874026" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "len(repeated_names) / len(df)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "About 6% (or 194241) names are repeated. " ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
issn_count
count194241.000000
mean3.197523
std25.081605
min2.000000
25%2.000000
50%2.000000
75%2.000000
max8980.000000
\n", "
" ], "text/plain": [ " issn_count\n", "count 194241.000000\n", "mean 3.197523\n", "std 25.081605\n", "min 2.000000\n", "25% 2.000000\n", "50% 2.000000\n", "75% 2.000000\n", "max 8980.000000" ] }, "execution_count": 11, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names.describe()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Which name is shared by over 8000 ISSN?" ] }, { "cell_type": "code", "execution_count": 12, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "name Annual report.\n", "issn_count 8980\n", "Name: 45907, dtype: object" ] }, "execution_count": 12, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names.iloc[repeated_names.issn_count.argmax()] # Annual report." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "It is the \"Annual report.\"" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['0706-537X',\n", " '1186-7957',\n", " '2324-1926',\n", " '1445-9248',\n", " '0872-3982',\n", " '1714-1524',\n", " '1037-8812',\n", " '0225-0241',\n", " '1327-6344',\n", " '0702-7702']" ] }, "execution_count": 13, "metadata": {}, "output_type": "execute_result" } ], "source": [ "mapping[\"Annual report.\"][:10]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "On average a repeated name will point to 3 ISSN. About 24k names point to more than 3 ISSN." ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "24107" ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" } ], "source": [ "len(repeated_names[repeated_names.issn_count > 3])" ] }, { "cell_type": "code", "execution_count": 15, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
395908Criminal trial manual.4
336247Mesures.4
2242923Tinere speranţe.4
380962Résistances.5
823415˜Le œPaysan.14
157897Barron's regents exams and answers.18
468031Revista científica.14
1102294Kotoba to bunka.4
2049420˜L'œHarmonie.7
10843Design.14
\n", "
" ], "text/plain": [ " name issn_count\n", "395908 Criminal trial manual. 4\n", "336247 Mesures. 4\n", "2242923 Tinere speranţe. 4\n", "380962 Résistances. 5\n", "823415 ˜Le œPaysan. 14\n", "157897 Barron's regents exams and answers. 18\n", "468031 Revista científica. 14\n", "1102294 Kotoba to bunka. 4\n", "2049420 ˜L'œHarmonie. 7\n", "10843 Design. 14" ] }, "execution_count": 15, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names[repeated_names.issn_count > 3].sample(n=10)" ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['1285-9133',\n", " '1480-4670',\n", " '1487-5349',\n", " '1724-6598',\n", " '2183-0134',\n", " '2538-693X',\n", " '2610-8933',\n", " '2035-8326',\n", " '2295-9084',\n", " '1517-8889',\n", " '2249-5053',\n", " '2420-9198',\n", " '2654-9263',\n", " '2610-8925',\n", " '1158-9574',\n", " '0872-4784',\n", " '0379-8402']" ] }, "execution_count": 16, "metadata": {}, "output_type": "execute_result" } ], "source": [ "mapping[\"Philosophica.\"]" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 17, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYMAAAD6CAYAAABDPiuvAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMCwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy86wFpkAAAACXBIWXMAAAsTAAALEwEAmpwYAAASOUlEQVR4nO3db4xd9X3n8fdncaCUpMGEduRga+2obiVvoxI6Il4lWk1D1xiyWlMpimBRcf5sXTUgJS3S1mkfkA2JlFQluyXqErnFG7OicSihtZU663VZrqo8gAAJxfwJ8RScYstAGxPIJFKyTr99cH8D93pnmPEde2Zy7/slXc053/M7557zzTGfOeeeO0lVIUkabf9qqXdAkrT0DANJkmEgSTIMJEkYBpIkDANJEvMIgyRrktyX5Ikkjyf5cKt/LMnRJI+015U963w0yWSSp5Jc3lPf3GqTSbb31NcleaDVv5jk7NN9oJKk2WWu7xkkWQWsqqqvJ3kD8DBwFfBeYKqq/uik8RuALwCXAm8G/gb4hbb4W8C/B44ADwLXVNUTSe4C7qmq3Uk+B/xdVd32Wvt14YUX1tq1a0/lWF/x/e9/n/POO2+gdYeR/XiVvehnP/oNQz8efvjhf6qqnz25vmKuFavqGHCsTX8vyZPARa+xyhZgd1X9EHgmySTdYACYrKqnAZLsBra07b0L+E9tzC7gY8BrhsHatWt56KGH5tr9GXU6HSYmJgZadxjZj1fZi372o98w9CPJt2eqn9JnBknWAm8DHmilG5I8mmRnkpWtdhHwbM9qR1pttvqbgO9W1YmT6pKkRTLnlcG0JK8HvgR8pKpeTnIbcDNQ7ectwAfOyF6+ug/bgG0AY2NjdDqdgbYzNTU18LrDyH68yl70sx/9hrkf8wqDJK+jGwR3VtU9AFX1fM/yPwW+3GaPAmt6Vl/dasxS/w5wfpIV7eqgd3yfqtoB7AAYHx+vQS/XhuFS73SyH6+yF/3sR79h7sd8niYKcDvwZFV9pqe+qmfYrwOPtem9wNVJzkmyDlgPfI3uB8br25NDZwNXA3ur+wn2fcB72vpbgT0LOyxJ0qmYz5XBO4DfAA4meaTVfh+4JsnFdG8THQZ+C6CqHm9PBz0BnACur6ofAyS5AdgPnAXsrKrH2/Z+D9id5BPAN+iGjyRpkcznaaKvAplh0b7XWOeTwCdnqO+bab32hNGlJ9clSYvDbyBLkgwDSZJhIEniFL5nMEwOHn2J923/64HWPfypd5/mvZGkpeeVgSTJMJAkGQaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CSxDzCIMmaJPcleSLJ40k+3OoXJDmQ5FD7ubLVk+TWJJNJHk1ySc+2trbxh5Js7an/SpKDbZ1bk+RMHKwkaWbzuTI4AdxYVRuAjcD1STYA24F7q2o9cG+bB7gCWN9e24DboBsewE3A24FLgZumA6SN+c2e9TYv/NAkSfM1ZxhU1bGq+nqb/h7wJHARsAXY1YbtAq5q01uAO6rrfuD8JKuAy4EDVXW8ql4EDgCb27Kfqar7q6qAO3q2JUlaBKf0mUGStcDbgAeAsao61hY9B4y16YuAZ3tWO9Jqr1U/MkNdkrRIVsx3YJLXA18CPlJVL/fe1q+qSlJnYP9O3odtdG89MTY2RqfTGWg7Y+fCjW89MdC6g77ncjY1NTWUxzUIe9HPfvQb5n7MKwySvI5uENxZVfe08vNJVlXVsXar54VWPwqs6Vl9dasdBSZOqndaffUM4/8/VbUD2AEwPj5eExMTMw2b02fv3MMtB+edg30OXzvYey5nnU6HQXs5bOxFP/vRb5j7MZ+niQLcDjxZVZ/pWbQXmH4iaCuwp6d+XXuqaCPwUrudtB/YlGRl++B4E7C/LXs5ycb2Xtf1bEuStAjm8+vxO4DfAA4meaTVfh/4FHBXkg8C3wbe25btA64EJoEfAO8HqKrjSW4GHmzjPl5Vx9v0h4DPA+cCX2kvSdIimTMMquqrwGzP/V82w/gCrp9lWzuBnTPUHwJ+aa59kSSdGX4DWZJkGEiSDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJOYRBkl2JnkhyWM9tY8lOZrkkfa6smfZR5NMJnkqyeU99c2tNplke099XZIHWv2LSc4+nQcoSZrbfK4MPg9snqH+36rq4vbaB5BkA3A18G/aOv8jyVlJzgL+BLgC2ABc08YCfLpt6+eBF4EPLuSAJEmnbs4wqKq/BY7Pc3tbgN1V9cOqegaYBC5tr8mqerqqfgTsBrYkCfAu4O62/i7gqlM7BEnSQi3kM4MbkjzabiOtbLWLgGd7xhxptdnqbwK+W1UnTqpLkhbRigHXuw24Gaj28xbgA6drp2aTZBuwDWBsbIxOpzPQdsbOhRvfemLugTMY9D2Xs6mpqaE8rkHYi372o98w92OgMKiq56enk/wp8OU2exRY0zN0dasxS/07wPlJVrSrg97xM73vDmAHwPj4eE1MTAyy+3z2zj3ccnCwHDx87WDvuZx1Oh0G7eWwsRf97Ee/Ye7HQLeJkqzqmf11YPpJo73A1UnOSbIOWA98DXgQWN+eHDqb7ofMe6uqgPuA97T1twJ7BtknSdLg5vz1OMkXgAngwiRHgJuAiSQX071NdBj4LYCqejzJXcATwAng+qr6cdvODcB+4CxgZ1U93t7i94DdST4BfAO4/XQdnCRpfuYMg6q6ZobyrP/BrqpPAp+cob4P2DdD/Wm6TxtJkpaI30CWJBkGkiTDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSSJeYRBkp1JXkjyWE/tgiQHkhxqP1e2epLcmmQyyaNJLulZZ2sbfyjJ1p76ryQ52Na5NUlO90FKkl7bfK4MPg9sPqm2Hbi3qtYD97Z5gCuA9e21DbgNuuEB3AS8HbgUuGk6QNqY3+xZ7+T3kiSdYXOGQVX9LXD8pPIWYFeb3gVc1VO/o7ruB85Psgq4HDhQVcer6kXgALC5LfuZqrq/qgq4o2dbkqRFMuhnBmNVdaxNPweMtemLgGd7xh1ptdeqH5mhLklaRCsWuoGqqiR1OnZmLkm20b39xNjYGJ1OZ6DtjJ0LN771xEDrDvqey9nU1NRQHtcg7EU/+9FvmPsxaBg8n2RVVR1rt3peaPWjwJqecatb7SgwcVK90+qrZxg/o6raAewAGB8fr4mJidmGvqbP3rmHWw4OduiHrx3sPZezTqfDoL0cNvain/3oN8z9GPQ20V5g+omgrcCenvp17amijcBL7XbSfmBTkpXtg+NNwP627OUkG9tTRNf1bEuStEjm/PU4yRfo/lZ/YZIjdJ8K+hRwV5IPAt8G3tuG7wOuBCaBHwDvB6iq40luBh5s4z5eVdMfSn+I7hNL5wJfaS9J0iKaMwyq6ppZFl02w9gCrp9lOzuBnTPUHwJ+aa79kCSdOX4DWZJkGEiSDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJBYYBkkOJzmY5JEkD7XaBUkOJDnUfq5s9SS5NclkkkeTXNKzna1t/KEkWxd2SJKkU3U6rgx+taourqrxNr8duLeq1gP3tnmAK4D17bUNuA264QHcBLwduBS4aTpAJEmL40zcJtoC7GrTu4Creup3VNf9wPlJVgGXAweq6nhVvQgcADafgf2SJM1ioWFQwP9J8nCSba02VlXH2vRzwFibvgh4tmfdI602W12StEhWLHD9d1bV0SQ/BxxI8s3ehVVVSWqB7/GKFjjbAMbGxuh0OgNtZ+xcuPGtJwZad9D3XM6mpqaG8rgGYS/62Y9+w9yPBYVBVR1tP19I8pd07/k/n2RVVR1rt4FeaMOPAmt6Vl/dakeBiZPqnVnebwewA2B8fLwmJiZmGjanz965h1sODnboh68d7D2Xs06nw6C9HDb2op/96DfM/Rj4NlGS85K8YXoa2AQ8BuwFpp8I2grsadN7gevaU0UbgZfa7aT9wKYkK9sHx5taTZK0SBZyZTAG/GWS6e38eVX97yQPAncl+SDwbeC9bfw+4EpgEvgB8H6Aqjqe5GbgwTbu41V1fAH7JUk6RQOHQVU9DfzyDPXvAJfNUC/g+lm2tRPYOei+SJIWxm8gS5IMA0mSYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CSxDIKgySbkzyVZDLJ9qXeH0kaJcsiDJKcBfwJcAWwAbgmyYal3StJGh3LIgyAS4HJqnq6qn4E7Aa2LPE+SdLIWC5hcBHwbM/8kVaTJC2CFUu9A6ciyTZgW5udSvLUgJu6EPingfbh0wO+4/I2cD+GkL3oZz/6DUM//vVMxeUSBkeBNT3zq1utT1XtAHYs9M2SPFRV4wvdzrCwH6+yF/3sR79h7sdyuU30ILA+ybokZwNXA3uXeJ8kaWQsiyuDqjqR5AZgP3AWsLOqHl/i3ZKkkbEswgCgqvYB+xbp7RZ8q2nI2I9X2Yt+9qPf0PYjVbXU+yBJWmLL5TMDSdISGqkwGJU/eZFkTZL7kjyR5PEkH271C5IcSHKo/VzZ6klya+vLo0ku6dnW1jb+UJKtS3VMC5XkrCTfSPLlNr8uyQPtmL/YHlwgyTltfrItX9uzjY+2+lNJLl+iQ1mwJOcnuTvJN5M8meTfjvi58Tvt38ljSb6Q5KdG8vyoqpF40f1g+u+BtwBnA38HbFjq/TpDx7oKuKRNvwH4Ft0/8/GHwPZW3w58uk1fCXwFCLAReKDVLwCebj9XtumVS318A/bkd4E/B77c5u8Crm7TnwN+u01/CPhcm74a+GKb3tDOmXOAde1cOmupj2vAXuwC/nObPhs4f1TPDbpfbn0GOLfnvHjfKJ4fo3RlMDJ/8qKqjlXV19v094An6Z70W+j+h4D286o2vQW4o7ruB85Psgq4HDhQVcer6kXgALB58Y7k9EiyGng38GdtPsC7gLvbkJN7Md2ju4HL2vgtwO6q+mFVPQNM0j2nfqIkeSPw74DbAarqR1X1XUb03GhWAOcmWQH8NHCMETw/RikMRvJPXrTL2LcBDwBjVXWsLXoOGGvTs/VmWHr234H/Avxzm38T8N2qOtHme4/rlWNuy19q44elF+uAfwT+Z7tt9mdJzmNEz42qOgr8EfAPdEPgJeBhRvD8GKUwGDlJXg98CfhIVb3cu6y617ZD/yhZkv8AvFBVDy/1viwTK4BLgNuq6m3A9+neFnrFqJwbAO2zkS10Q/LNwHn85F7hLMgohcG8/uTFsEjyOrpBcGdV3dPKz7dLfNrPF1p9tt4MQ8/eAfzHJIfp3hp8F/DHdG93TH/Ppve4XjnmtvyNwHcYjl5A9zfWI1X1QJu/m244jOK5AfBrwDNV9Y9V9f+Ae+ieMyN3foxSGIzMn7xo9zBvB56sqs/0LNoLTD/1sRXY01O/rj05shF4qd0y2A9sSrKy/Qa1qdV+YlTVR6tqdVWtpfu/+f+tqmuB+4D3tGEn92K6R+9p46vVr25Pk6wD1gNfW6TDOG2q6jng2SS/2EqXAU8wgudG8w/AxiQ/3f7dTPdj9M6Ppf4EezFfdJ+M+BbdT/r/YKn35wwe5zvpXuY/CjzSXlfSvbd5L3AI+BvggjY+dP/Phf4eOAiM92zrA3Q/DJsE3r/Ux7bAvkzw6tNEb6H7j3US+AvgnFb/qTY/2Za/pWf9P2g9egq4YqmPZwF9uBh4qJ0ff0X3aaCRPTeA/wp8E3gM+F90nwgaufPDbyBLkkbqNpEkaRaGgSTJMJAkGQaSJAwDSRKGgSQJw0CShGEgSQL+BcZ4UAlHAlHmAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "repeated_names[repeated_names.issn_count > 3].issn_count.hist(bins=20)" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 18, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYMAAAD4CAYAAAAO9oqkAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMCwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy86wFpkAAAACXBIWXMAAAsTAAALEwEAmpwYAAAU30lEQVR4nO3dfYxd9Z3f8fendshGkBQI7MiLSe1onVQ87LrLCKg2XQ2hAYdEMaki1ogGO2HjRAEpUV1tzXYl0lAktt1suqgpWydYGDWLg0IIViAlXjcjUqkkNsHlMZSBGMWWsbVAwk4S0Tr77R/3Nzt3vGN7fO94rvG8X9LVPed7Hu7vfNHwmfNwx6kqJEnz2z8Y9AAkSYNnGEiSDANJkmEgScIwkCQBCwc9gF6dccYZtWTJkkEP45j5+c9/zsknnzzoYRwX7MVU9mMq+zFpJr149NFH/7qqzjy4/oYNgyVLlrBjx45BD+OYGR0dZWRkZNDDOC7Yi6nsx1T2Y9JMepHkxenqXiaSJBkGkiTDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRJv4G8g92PJ+gcG8rm7bv3AQD5Xko7EMwNJkmEgSTIMJEnMIAySbEyyP8mTXbWvJdnZXruS7Gz1JUl+2bXsL7q2uSDJE0nGktyWJK1+epKtSZ5r76cdg+OUJB3GTM4M7gRWdBeq6veranlVLQfuBb7Rtfj5iWVV9amu+u3AJ4Bl7TWxz/XAtqpaBmxr85KkOXTEMKiqh4FXplvWfru/Crj7cPtIsgh4W1U9UlUF3AVc2RavBDa16U1ddUnSHOn30dJ/Buyrque6akuTPAa8BvxxVX0POAvY3bXO7lYDGKqqvW36JWDoUB+WZC2wFmBoaIjR0dGeBr3u/AM9bdevoxnv+Ph4z8d3orEXU9mPqezHpH560W8YXM3Us4K9wDuq6uUkFwDfTHLuTHdWVZWkDrN8A7ABYHh4uHr9143WDOp7BteMzHhd//WmSfZiKvsxlf2Y1E8veg6DJAuBfwFcMFGrqteB19v0o0meB94F7AEWd22+uNUA9iVZVFV72+Wk/b2OSZLUm34eLf3nwI+q6u8u/yQ5M8mCNv1OOjeKX2iXgV5LcnG7z3AtcH/bbAuwuk2v7qpLkubITB4tvRv4X8C7k+xOcl1btIq/f+P494DH26OmXwc+VVUTN58/DXwFGAOeB77d6rcC70vyHJ2AubX3w5Ek9eKIl4mq6upD1NdMU7uXzqOm062/AzhvmvrLwKVHGock6djxG8iSJMNAkmQYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkphBGCTZmGR/kie7ap9LsifJzva6omvZjUnGkjyb5PKu+opWG0uyvqu+NMn3W/1rSU6azQOUJB3ZTM4M7gRWTFP/YlUtb68HAZKcA6wCzm3b/JckC5IsAL4EvB84B7i6rQvwJ21fvwm8ClzXzwFJko7eEcOgqh4GXpnh/lYCm6vq9ar6MTAGXNheY1X1QlX9X2AzsDJJgPcCX2/bbwKuPLpDkCT1q597BjckebxdRjqt1c4CftK1zu5WO1T97cBPq+rAQXVJ0hxa2ON2twM3A9XevwB8fLYGdShJ1gJrAYaGhhgdHe1pP+vOP3DklY6Boxnv+Ph4z8d3orEXU9mPqezHpH560VMYVNW+iekkXwa+1Wb3AGd3rbq41ThE/WXg1CQL29lB9/rTfe4GYAPA8PBwjYyM9DJ81qx/oKft+rXrmpEZrzs6Okqvx3eisRdT2Y+p7MekfnrR02WiJIu6Zj8MTDxptAVYleTNSZYCy4AfANuBZe3JoZPo3GTeUlUFfBf4SNt+NXB/L2OSJPXuiGcGSe4GRoAzkuwGbgJGkiync5loF/BJgKp6Ksk9wNPAAeD6qvpV288NwEPAAmBjVT3VPuLfAJuT/HvgMeCO2To4SdLMHDEMqurqacqH/B92Vd0C3DJN/UHgwWnqL9B52kiSNCB+A1mSZBhIkgwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJIkZhEGSjUn2J3myq/Yfk/woyeNJ7ktyaqsvSfLLJDvb6y+6trkgyRNJxpLcliStfnqSrUmea++nHYPjlCQdxkzODO4EVhxU2wqcV1W/Bfwf4MauZc9X1fL2+lRX/XbgE8Cy9prY53pgW1UtA7a1eUnSHDpiGFTVw8ArB9W+U1UH2uwjwOLD7SPJIuBtVfVIVRVwF3BlW7wS2NSmN3XVJUlzZOEs7OPjwNe65pcmeQx4DfjjqvoecBawu2ud3a0GMFRVe9v0S8DQoT4oyVpgLcDQ0BCjo6M9DXjd+QeOvNIxcDTjHR8f7/n4TjT2Yir7MZX9mNRPL/oKgyT/FjgAfLWV9gLvqKqXk1wAfDPJuTPdX1VVkjrM8g3ABoDh4eEaGRnpadxr1j/Q03b92nXNyIzXHR0dpdfjO9HYi6nsx1T2Y1I/veg5DJKsAT4IXNou/VBVrwOvt+lHkzwPvAvYw9RLSYtbDWBfkkVVtbddTtrf65gkSb3p6dHSJCuAPwQ+VFW/6KqfmWRBm34nnRvFL7TLQK8lubg9RXQtcH/bbAuwuk2v7qpLkubIEc8MktwNjABnJNkN3ETn6aE3A1vbE6KPtCeHfg/4fJL/B/wt8Kmqmrj5/Gk6Tya9Bfh2ewHcCtyT5DrgReCqWTkySdKMHTEMqurqacp3HGLde4F7D7FsB3DeNPWXgUuPNA5J0rHjN5AlSYaBJMkwkCRhGEiSMAwkSRgGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJDHDMEiyMcn+JE921U5PsjXJc+39tFZPktuSjCV5PMnvdG2zuq3/XJLVXfULkjzRtrktSWbzICVJhzfTM4M7gRUH1dYD26pqGbCtzQO8H1jWXmuB26ETHsBNwEXAhcBNEwHS1vlE13YHf5Yk6RiaURhU1cPAKweVVwKb2vQm4Mqu+l3V8QhwapJFwOXA1qp6papeBbYCK9qyt1XVI1VVwF1d+5IkzYF+7hkMVdXeNv0SMNSmzwJ+0rXe7lY7XH33NHVJ0hxZOBs7qapKUrOxr8NJspbOpSeGhoYYHR3taT/rzj8wi6OauaMZ7/j4eM/Hd6KxF1PZj6nsx6R+etFPGOxLsqiq9rZLPftbfQ9wdtd6i1ttDzByUH201RdPs/7fU1UbgA0Aw8PDNTIyMt1qR7Rm/QM9bdevXdeMzHjd0dFRej2+E429mMp+TGU/JvXTi34uE20BJp4IWg3c31W/tj1VdDHws3Y56SHgsiSntRvHlwEPtWWvJbm4PUV0bde+JElzYEZnBknupvNb/RlJdtN5KuhW4J4k1wEvAle11R8ErgDGgF8AHwOoqleS3Axsb+t9vqombkp/ms4TS28Bvt1ekqQ5MqMwqKqrD7Ho0mnWLeD6Q+xnI7BxmvoO4LyZjEWSNPv8BrIkyTCQJBkGkiQMA0kShoEkCcNAkoRhIEnCMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRJ9hEGSdyfZ2fV6Lclnk3wuyZ6u+hVd29yYZCzJs0ku76qvaLWxJOv7PShJ0tFZ2OuGVfUssBwgyQJgD3Af8DHgi1X1p93rJzkHWAWcC/wG8FdJ3tUWfwl4H7Ab2J5kS1U93evYJElHp+cwOMilwPNV9WKSQ62zEthcVa8DP04yBlzYlo1V1QsASTa3dQ0DSZojsxUGq4C7u+ZvSHItsANYV1WvAmcBj3Sts7vVAH5yUP2i6T4kyVpgLcDQ0BCjo6M9DXbd+Qd62q5fRzPe8fHxno/vRGMvprIfU9mPSf30ou8wSHIS8CHgxla6HbgZqPb+BeDj/X4OQFVtADYADA8P18jISE/7WbP+gdkYzlHbdc3IjNcdHR2l1+M70diLqezHVPZjUj+9mI0zg/cDP6yqfQAT7wBJvgx8q83uAc7u2m5xq3GYuiRpDszGo6VX03WJKMmirmUfBp5s01uAVUnenGQpsAz4AbAdWJZkaTvLWNXWlSTNkb7ODJKcTOcpoE92lf9DkuV0LhPtmlhWVU8luYfOjeEDwPVV9au2nxuAh4AFwMaqeqqfcUmSjk5fYVBVPwfeflDto4dZ/xbglmnqDwIP9jMWSVLv/AayJMkwkCQZBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCRhGEiSMAwkSRgGkiQMA0kSsxAGSXYleSLJziQ7Wu30JFuTPNfeT2v1JLktyViSx5P8Ttd+Vrf1n0uyut9xSZJmbrbODC6pquVVNdzm1wPbqmoZsK3NA7wfWNZea4HboRMewE3ARcCFwE0TASJJOvaO1WWilcCmNr0JuLKrfld1PAKcmmQRcDmwtapeqapXga3AimM0NknSQRbOwj4K+E6SAv5rVW0Ahqpqb1v+EjDUps8CftK17e5WO1R9iiRr6ZxRMDQ0xOjoaE8DXnf+gZ6269fRjHd8fLzn4zvR2Iup7MdU9mNSP72YjTB4T1XtSfLrwNYkP+peWFXVgqJvLWg2AAwPD9fIyEhP+1mz/oHZGM5R23XNyIzXHR0dpdfjO9HYi6nsx1T2Y1I/vej7MlFV7Wnv+4H76Fzz39cu/9De97fV9wBnd22+uNUOVZckzYG+wiDJyUneOjENXAY8CWwBJp4IWg3c36a3ANe2p4ouBn7WLic9BFyW5LR24/iyVpMkzYF+LxMNAfclmdjXX1bVf0+yHbgnyXXAi8BVbf0HgSuAMeAXwMcAquqVJDcD29t6n6+qV/ocmyRphvoKg6p6AfjtaeovA5dOUy/g+kPsayOwsZ/xSJJ64zeQJUmGgSTJMJAkYRhIkjAMJEkYBpIkDANJEoaBJAnDQJKEYSBJwjCQJGEYSJIwDCRJGAaSJAwDSRKGgSQJw0CShGEgScIwkCTRRxgkOTvJd5M8neSpJJ9p9c8l2ZNkZ3td0bXNjUnGkjyb5PKu+opWG0uyvr9DkiQdrYV9bHsAWFdVP0zyVuDRJFvbsi9W1Z92r5zkHGAVcC7wG8BfJXlXW/wl4H3AbmB7ki1V9XQfY5MkHYWew6Cq9gJ72/TfJHkGOOswm6wENlfV68CPk4wBF7ZlY1X1AkCSzW1dw0CS5kiqqv+dJEuAh4HzgH8FrAFeA3bQOXt4Ncl/Bh6pqv/WtrkD+HbbxYqq+oNW/yhwUVXdMM3nrAXWAgwNDV2wefPmnsb7xJ6f9bTdXBp6C+z75ezt7/yz/uHs7WyOjY+Pc8oppwx6GMcN+zGV/Zg0k15ccsklj1bV8MH1fi4TAZDkFOBe4LNV9VqS24GbgWrvXwA+3u/nAFTVBmADwPDwcI2MjPS0nzXrH5iN4RxT684/wBee6Ps/z9/Zdc3IrO1rro2OjtLrf+sTkf2Yyn5M6qcXff3fJsmb6ATBV6vqGwBVta9r+ZeBb7XZPcDZXZsvbjUOU5ckzYF+niYKcAfwTFX9WVd9UddqHwaebNNbgFVJ3pxkKbAM+AGwHViWZGmSk+jcZN7S67gkSUevnzOD3wU+CjyRZGer/RFwdZLldC4T7QI+CVBVTyW5h86N4QPA9VX1K4AkNwAPAQuAjVX1VB/jkiQdpX6eJvqfQKZZ9OBhtrkFuGWa+oOH206SdGz5DWRJkmEgSTIMJEkYBpIkDANJEoaBJAnDQJKEYSBJYhb+UJ3eGJYM6I/z7br1AwP5XElHxzMDSZJhIEkyDCRJGAaSJAwDSRKGgSQJw0CShN8z0DE2G99vWHf+Adb0sB+/4yDNnGcGkiTPDHTi8lvX0sx5ZiBJOn7ODJKsAP4cWAB8papuHfCQpJ4c6zOSw91D8axEvTouwiDJAuBLwPuA3cD2JFuq6unBjkx6Y/HSmHp1XIQBcCEwVlUvACTZDKwEDAPpDWBQIQS9P232RnWsgjdVdUx2fFSDSD4CrKiqP2jzHwUuqqobDlpvLbC2zb4beHZOBzq3zgD+etCDOE7Yi6nsx1T2Y9JMevGPqurMg4vHy5nBjFTVBmDDoMcxF5LsqKrhQY/jeGAvprIfU9mPSf304nh5mmgPcHbX/OJWkyTNgeMlDLYDy5IsTXISsArYMuAxSdK8cVxcJqqqA0luAB6i82jpxqp6asDDGrR5cTlshuzFVPZjKvsxqedeHBc3kCVJg3W8XCaSJA2QYSBJMgwGLcnGJPuTPNlVOz3J1iTPtffTBjnGuZTk7CTfTfJ0kqeSfKbV511Pkvxakh8k+d+tF/+u1Zcm+X6SsSRfaw9dzBtJFiR5LMm32vy87UeSXUmeSLIzyY5W6+lnxTAYvDuBFQfV1gPbqmoZsK3NzxcHgHVVdQ5wMXB9knOYnz15HXhvVf02sBxYkeRi4E+AL1bVbwKvAtcNbogD8Rngma75+d6PS6pqedf3C3r6WTEMBqyqHgZeOai8EtjUpjcBV87lmAapqvZW1Q/b9N/Q+aE/i3nYk+oYb7Nvaq8C3gt8vdXnRS8mJFkMfAD4SpsP87gfh9DTz4phcHwaqqq9bfolYGiQgxmUJEuAfwJ8n3nak3ZJZCewH9gKPA/8tKoOtFV20wnL+eI/AX8I/G2bfzvzux8FfCfJo+3P9UCPPyvHxfcMdGhVVUnm3fO/SU4B7gU+W1WvdX4B7JhPPamqXwHLk5wK3Af848GOaHCSfBDYX1WPJhkZ8HCOF++pqj1Jfh3YmuRH3QuP5mfFM4Pj074kiwDa+/4Bj2dOJXkTnSD4alV9o5XndU+q6qfAd4F/CpyaZOIXufn0p1t+F/hQkl3AZjqXh/6c+dsPqmpPe99P55eFC+nxZ8UwOD5tAVa36dXA/QMcy5xq14DvAJ6pqj/rWjTvepLkzHZGQJK30Pn3Pp6hEwofaavNi14AVNWNVbW4qpbQ+ZM1/6OqrmGe9iPJyUneOjENXAY8SY8/K34DecCS3A2M0PnTs/uAm4BvAvcA7wBeBK6qqoNvMp+QkrwH+B7wBJPXhf+Izn2DedWTJL9F5wbgAjq/uN1TVZ9P8k46vxmfDjwG/Muqen1wI5177TLRv66qD87XfrTjvq/NLgT+sqpuSfJ2evhZMQwkSV4mkiQZBpIkDANJEoaBJAnDQJKEYSBJwjCQJAH/H5yhSM0MW4wRAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "repeated_names[(repeated_names.issn_count > 3) & (repeated_names.issn_count < 50)].issn_count.hist(bins=10)" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 19, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAYMAAAD4CAYAAAAO9oqkAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMCwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy86wFpkAAAACXBIWXMAAAsTAAALEwEAmpwYAAAVG0lEQVR4nO3cf5Bd9X3e8fdTVMwPxQhCuqWSpmJqxRkMsQtbIPU0XUEGhPFY/OF48NBYOGo0TbFDXKY2JJPS2mYGx6YE09gdjVGREw0yoU6lsbGxBrP1ZKbCGOMgfpiwBRmkYuRYQCqDTeR++sf9Ml6WXbR772rvUfR+zezsPd/zPec+V9Lq2fPj3lQVkqQj298bdgBJ0vBZBpIky0CSZBlIkrAMJEnAomEH6NfJJ59cK1as6GvbH/3oRxx//PHzG2iedT1j1/NB9zN2PR90P2PX80H3Mt5///1/XVW/8JoVVXVYfp111lnVr3vuuafvbRdK1zN2PV9V9zN2PV9V9zN2PV9V9zIC36pp/k/1NJEkyTKQJFkGkiQsA0kSloEkCctAkoRlIEnCMpAkYRlIkjiMP45iEDv3vMDlV395wZ931/UXL/hzStJseGQgSbIMJEmWgSQJy0CShGUgScIykCQxizJIsjHJ3iQPTRr7ZJLvJnkwyZ8nWTJp3TVJJpI8luTCSeOr29hEkqsnjZ+a5N42/oUkR8/j65MkzcJsjgxuBVZPGdsOnF5Vvwz8FXANQJLTgEuBt7RtPpPkqCRHAX8MXAScBry3zQX4BHBjVb0JeA5YN9ArkiTN2UHLoKq+AeybMva1qjrQFncAy9rjNcCWqvpJVT0JTABnt6+Jqnqiql4GtgBrkgQ4D7ijbb8JuGSwlyRJmqv5eAfybwJfaI+X0iuHV+xuYwBPTxk/B/h54PlJxTJ5/mskWQ+sBxgZGWF8fLyvwCPHwlVnHDj4xHk2l7z79+/v+/UthK7ng+5n7Ho+6H7GrueDwyMjDFgGSX4fOABsnp84r6+qNgAbAEZHR2tsbKyv/dy8eSs37Fz4T+LYddnYrOeOj4/T7+tbCF3PB93P2PV80P2MXc8Hh0dGGKAMklwOvBM4v6qqDe8Blk+atqyNMcP4D4ElSRa1o4PJ8yVJC6SvW0uTrAY+DLyrql6ctGobcGmSNyQ5FVgJfBO4D1jZ7hw6mt5F5m2tRO4B3t22Xwts7e+lSJL6NZtbS28D/hfw5iS7k6wD/gvwc8D2JN9J8l8Bquph4HbgEeCrwBVV9dP2W/8HgLuAR4Hb21yAjwD/LskEvWsIt8zrK5QkHdRBTxNV1XunGZ7xP+yqug64bprxO4E7pxl/gt7dRpKkIfEdyJIky0CSZBlIkrAMJElYBpIkLANJEpaBJAnLQJKEZSBJwjKQJGEZSJKwDCRJWAaSJCwDSRKWgSQJy0CShGUgScIykCRhGUiSsAwkSVgGkiQsA0kSloEkiVmUQZKNSfYmeWjS2ElJtid5vH0/sY0nyaeTTCR5MMmZk7ZZ2+Y/nmTtpPGzkuxs23w6Seb7RUqSXt9sjgxuBVZPGbsauLuqVgJ3t2WAi4CV7Ws98FnolQdwLXAOcDZw7SsF0ub81qTtpj6XJOkQO2gZVNU3gH1ThtcAm9rjTcAlk8Y/Xz07gCVJTgEuBLZX1b6qeg7YDqxu695YVTuqqoDPT9qXJGmB9HvNYKSqnmmPvw+MtMdLgacnzdvdxl5vfPc045KkBbRo0B1UVSWp+QhzMEnW0zv9xMjICOPj433tZ+RYuOqMA/OYbHbmknf//v19v76F0PV80P2MXc8H3c/Y9XxweGSE/svg2SSnVNUz7VTP3ja+B1g+ad6yNrYHGJsyPt7Gl00zf1pVtQHYADA6OlpjY2MzTX1dN2/eyg07B+7BOdt12dis546Pj9Pv61sIXc8H3c/Y9XzQ/YxdzweHR0bo/zTRNuCVO4LWAlsnjb+v3VV0LvBCO510F3BBkhPbheMLgLvaur9Jcm67i+h9k/YlSVogB/31OMlt9H6rPznJbnp3BV0P3J5kHfA94D1t+p3AO4AJ4EXg/QBVtS/Jx4D72ryPVtUrF6X/Lb07lo4FvtK+JEkL6KBlUFXvnWHV+dPMLeCKGfazEdg4zfi3gNMPlkOSdOj4DmRJkmUgSbIMJElYBpIkLANJEpaBJAnLQJKEZSBJwjKQJGEZSJKwDCRJWAaSJCwDSRKWgSQJy0CShGUgScIykCRhGUiSsAwkSVgGkiQsA0kSloEkCctAkoRlIEliwDJI8qEkDyd5KMltSY5JcmqSe5NMJPlCkqPb3De05Ym2fsWk/VzTxh9LcuGAr0mSNEd9l0GSpcDvAKNVdTpwFHAp8Angxqp6E/AcsK5tsg54ro3f2OaR5LS23VuA1cBnkhzVby5J0twNeppoEXBskkXAccAzwHnAHW39JuCS9nhNW6atPz9J2viWqvpJVT0JTABnD5hLkjQHqar+N06uBK4DXgK+BlwJ7Gi//ZNkOfCVqjo9yUPA6qra3db9b+Ac4D+2bf60jd/StrljmudbD6wHGBkZOWvLli195d677wWefamvTQdyxtITZj13//79LF68+BCmGUzX80H3M3Y9H3Q/Y9fzQfcyrlq16v6qGp06vqjfHSY5kd5v9acCzwN/Ru80zyFTVRuADQCjo6M1NjbW135u3ryVG3b2/dL7tuuysVnPHR8fp9/XtxC6ng+6n7Hr+aD7GbueDw6PjDDYaaJfA56sqh9U1d8CXwTeDixpp40AlgF72uM9wHKAtv4E4IeTx6fZRpK0AAYpg6eAc5Mc1879nw88AtwDvLvNWQtsbY+3tWXa+q9X7xzVNuDSdrfRqcBK4JsD5JIkzVHf50qq6t4kdwDfBg4AD9A7hfNlYEuSj7exW9omtwB/kmQC2EfvDiKq6uEkt9MrkgPAFVX1035zSZLmbqAT51V1LXDtlOEnmOZuoKr6MfDrM+znOnoXoiVJQ+A7kCVJloEkyTKQJGEZSJKwDCRJWAaSJCwDSRKWgSQJy0CShGUgScIykCRhGUiSsAwkSVgGkiQsA0kSloEkCctAkoRlIEnCMpAkYRlIkrAMJElYBpIkLANJEpaBJIkByyDJkiR3JPlukkeT/EqSk5JsT/J4+35im5skn04ykeTBJGdO2s/aNv/xJGsHfVGSpLkZ9MjgJuCrVfVLwFuBR4GrgburaiVwd1sGuAhY2b7WA58FSHIScC1wDnA2cO0rBSJJWhh9l0GSE4BfBW4BqKqXq+p5YA2wqU3bBFzSHq8BPl89O4AlSU4BLgS2V9W+qnoO2A6s7jeXJGnuUlX9bZi8DdgAPELvqOB+4EpgT1UtaXMCPFdVS5J8Cbi+qv6irbsb+AgwBhxTVR9v438AvFRVn5rmOdfTO6pgZGTkrC1btvSVfe++F3j2pb42HcgZS0+Y9dz9+/ezePHiQ5hmMF3PB93P2PV80P2MXc8H3cu4atWq+6tqdOr4ogH2uQg4E/hgVd2b5CZ+dkoIgKqqJP21zTSqagO9AmJ0dLTGxsb62s/Nm7dyw85BXnp/dl02Nuu54+Pj9Pv6FkLX80H3M3Y9H3Q/Y9fzweGREQa7ZrAb2F1V97blO+iVw7Pt9A/t+962fg+wfNL2y9rYTOOSpAXSdxlU1feBp5O8uQ2dT++U0TbglTuC1gJb2+NtwPvaXUXnAi9U1TPAXcAFSU5sF44vaGOSpAUy6LmSDwKbkxwNPAG8n17B3J5kHfA94D1t7p3AO4AJ4MU2l6ral+RjwH1t3kerat+AuSRJczBQGVTVd4DXXIigd5QwdW4BV8ywn43AxkGySJL65zuQJUmWgSTJMpAkYRlIkrAMJElYBpIkLANJEpaBJAnLQJKEZSBJwjKQJGEZSJKwDCRJWAaSJCwDSRKWgSQJy0CShGUgScIykCRhGUiSsAwkSVgGkiQsA0kSloEkiXkogyRHJXkgyZfa8qlJ7k0ykeQLSY5u429oyxNt/YpJ+7imjT+W5MJBM0mS5mY+jgyuBB6dtPwJ4MaqehPwHLCuja8DnmvjN7Z5JDkNuBR4C7Aa+EySo+YhlyRplgYqgyTLgIuBz7XlAOcBd7Qpm4BL2uM1bZm2/vw2fw2wpap+UlVPAhPA2YPkkiTNzaIBt/8j4MPAz7Xlnweer6oDbXk3sLQ9Xgo8DVBVB5K80OYvBXZM2ufkbV4lyXpgPcDIyAjj4+N9hR45Fq4648DBJ86zueTdv39/369vIXQ9H3Q/Y9fzQfczdj0fHB4ZYYAySPJOYG9V3Z9kbN4SvY6q2gBsABgdHa2xsf6e9ubNW7lh56A9OHe7Lhub9dzx8XH6fX0Loev5oPsZu54Pup+x6/ng8MgIgx0ZvB14V5J3AMcAbwRuApYkWdSODpYBe9r8PcByYHeSRcAJwA8njb9i8jaSpAXQ9zWDqrqmqpZV1Qp6F4C/XlWXAfcA727T1gJb2+NtbZm2/utVVW380na30anASuCb/eaSJM3doThX8hFgS5KPAw8At7TxW4A/STIB7KNXIFTVw0luBx4BDgBXVNVPD0EuSdIM5qUMqmocGG+Pn2Cau4Gq6sfAr8+w/XXAdfORRZI0d74DWZJkGUiSLANJEpaBJAnLQJKEZSBJwjKQJGEZSJKwDCRJWAaSJCwDSRKH5oPq1EErrv7yvO7vqjMOcPks9rnr+ovn9XklHRoeGUiSLANJkmUgScIykCRhGUiSsAwkSXhr6YKay+2ds711U5Lmg0cGkiTLQJJkGUiSsAwkSVgGkiQGKIMky5Pck+SRJA8nubKNn5Rke5LH2/cT23iSfDrJRJIHk5w5aV9r2/zHk6wd/GVJkuZikCODA8BVVXUacC5wRZLTgKuBu6tqJXB3Wwa4CFjZvtYDn4VeeQDXAucAZwPXvlIgkqSF0XcZVNUzVfXt9vj/Ao8CS4E1wKY2bRNwSXu8Bvh89ewAliQ5BbgQ2F5V+6rqOWA7sLrfXJKkuUtVDb6TZAXwDeB04KmqWtLGAzxXVUuSfAm4vqr+oq27G/gIMAYcU1Ufb+N/ALxUVZ+a5nnW0zuqYGRk5KwtW7b0lXfvvhd49qW+Nl0wI8fS6YyzzXfG0hMOfZgZ7N+/n8WLFw/t+Q+m6/mg+xm7ng+6l3HVqlX3V9Xo1PGB34GcZDHw34Hfraq/6f3/31NVlWTwtvnZ/jYAGwBGR0drbGysr/3cvHkrN+zs9puvrzrjQKczzjbfrsvGDn2YGYyPj9Pvv5GF0PV80P2MXc8Hh0dGGPBuoiR/n14RbK6qL7bhZ9vpH9r3vW18D7B80ubL2thM45KkBTLI3UQBbgEerar/PGnVNuCVO4LWAlsnjb+v3VV0LvBCVT0D3AVckOTEduH4gjYmSVogg5yHeDvwG8DOJN9pY78HXA/cnmQd8D3gPW3dncA7gAngReD9AFW1L8nHgPvavI9W1b4BckmS5qjvMmgXgjPD6vOnmV/AFTPsayOwsd8skqTB+A5kSZJlIEmyDCRJWAaSJCwDSRKWgSQJy0CSxDx8NpHUVTv3vMDlV395wZ931/UXL/hzSoPyyECSZBlIkiwDSRKWgSQJLyDrEFsxhAu4r7jqjKE9tXTY8chAkmQZSJIsA0kSXjOQ/k6Z72s0V51xYFZv3PONdoc/jwwkSZaBJMnTRNK8m+2pmtmegpEWgmUg6bA1rA8jhL9710k8TSRJ8shAkvoxrNOBh+qIxDKQNLBhfeyIHzkyfzpzmijJ6iSPJZlIcvWw80jSkaQTZZDkKOCPgYuA04D3JjltuKkk6cjRiTIAzgYmquqJqnoZ2AKsGXImSTpipKqGnYEk7wZWV9W/bsu/AZxTVR+YMm89sL4tvhl4rM+nPBn46z63XShdz9j1fND9jF3PB93P2PV80L2M/7iqfmHq4GF1AbmqNgAbBt1Pkm9V1eg8RDpkup6x6/mg+xm7ng+6n7Hr+eDwyAjdOU20B1g+aXlZG5MkLYCulMF9wMokpyY5GrgU2DbkTJJ0xOjEaaKqOpDkA8BdwFHAxqp6+BA+5cCnmhZA1zN2PR90P2PX80H3M3Y9HxweGbtxAVmSNFxdOU0kSRoiy0CSdOSVQZKjkjyQ5EvDzjKdJEuS3JHku0keTfIrw840VZIPJXk4yUNJbktyzJDzbEyyN8lDk8ZOSrI9yePt+4kdzPjJ9vf8YJI/T7JkiBGnzThp3VVJKsnJw8jWMkybL8kH25/jw0n+cFj5Wpbp/p7flmRHku8k+VaSs4eZcSZHXBkAVwKPDjvE67gJ+GpV/RLwVjqWNclS4HeA0ao6nd4F/0uHm4pbgdVTxq4G7q6qlcDdbXmYbuW1GbcDp1fVLwN/BVyz0KGmuJXXZiTJcuAC4KmFDjTFrUzJl2QVvU8reGtVvQX41BByTXYrr/0z/EPgP1XV24D/0JY754gqgyTLgIuBzw07y3SSnAD8KnALQFW9XFXPDzXU9BYBxyZZBBwH/J9hhqmqbwD7pgyvATa1x5uASxYy01TTZayqr1XVgba4g977a4Zmhj9HgBuBDwNDvdtkhny/DVxfVT9pc/YueLBJZshYwBvb4xMY8s/LTI6oMgD+iN4/6v835BwzORX4AfDf2qmszyU5ftihJquqPfR++3oKeAZ4oaq+NtxU0xqpqmfa4+8DI8MMMwu/CXxl2CGmSrIG2FNVfznsLDP4ReBfJLk3yf9M8s+GHWgavwt8MsnT9H52hn0EOK0jpgySvBPYW1X3DzvL61gEnAl8tqr+KfAjhn9641Xaufc19IrrHwHHJ/lXw031+qp3/3Rn76FO8vvAAWDzsLNMluQ44PfondroqkXAScC5wL8Hbk+S4UZ6jd8GPlRVy4EP0Y78u+aIKQPg7cC7kuyi96mo5yX50+FGeo3dwO6qurct30GvHLrk14Anq+oHVfW3wBeBfz7kTNN5NskpAO37UE8fzCTJ5cA7gcuqe2/6+Sf0Sv8v28/NMuDbSf7hUFO92m7gi9XzTXpH/UO7yD2DtfR+TgD+jN6nNHfOEVMGVXVNVS2rqhX0Lnh+vao69RttVX0feDrJm9vQ+cAjQ4w0naeAc5Mc134DO5+OXeRuttH7IaR93zrELNNKspreact3VdWLw84zVVXtrKp/UFUr2s/NbuDM9u+0K/4HsAogyS8CR9OtTwiF3jWCf9kenwc8PsQsM+rEx1HoVT4IbG6f0fQE8P4h53mVqro3yR3At+md2niAIb/dPsltwBhwcpLdwLXA9fROGawDvge8Z3gJZ8x4DfAGYHs7s7Gjqv5NlzJWVWdOaczwZ7gR2Nhu5XwZWDvMI6wZMv4WcFO74eLH/Oxj+DvFj6OQJB05p4kkSTOzDCRJloEkyTKQJGEZSJKwDCRJWAaSJOD/A1t+yYNMuQBJAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "repeated_names[(repeated_names.issn_count > 3) & (repeated_names.issn_count < 20)].issn_count.hist(bins=10)" ] }, { "cell_type": "code", "execution_count": 20, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 20, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAX0AAAD4CAYAAAAAczaOAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMCwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy86wFpkAAAACXBIWXMAAAsTAAALEwEAmpwYAAAYmklEQVR4nO3df4xc5X3v8fendgDHTm0D6VxkW11LsahIfEPNChylisa4MQYijHQJdeSGBbna3is3PypLjblSrnX5ITlSKAlWQ7vCbkzqxnF9y7UvoaErw6i3f0DAQHGAIG/ABK+InbK2cxcIuZt++8c8SybbXc+Z3bOzP57PS1rNOc95zjPP9zz2d848c2aOIgIzM8vDb0x1B8zMrH2c9M3MMuKkb2aWESd9M7OMOOmbmWVk7lR34Fwuvvji6OjoGPf+b731FvPnzy+vQ1NktsQBjmU6mi1xgGMZduTIkX+NiA+Otm1aJ/2Ojg6efvrpce9fq9WoVqvldWiKzJY4wLFMR7MlDnAswyS9NtY2T++YmWXESd/MLCNO+mZmGXHSNzPLiJO+mVlGnPTNzDLipG9mlhEnfTOzjDjpm5llZFp/I3eijvaf5dZt32378x7fcX3bn9PMrAif6ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWESd9M7OMFEr6kv5U0guSfiDp25IukLRc0pOS+iR9R9J5qe75ab0vbe9oaOf2VP6ypGsmKSYzMxtD06QvaQnweaAzIj4CzAE2Al8B7o2IDwGngc1pl83A6VR+b6qHpMvSfh8G1gPfkDSn3HDMzOxcik7vzAXmSZoLvB94A7gaOJC27wFuTMsb0jpp+1pJSuX7IuLdiHgV6AOunHAEZmZWWNPf3omIfklfBX4MvAP8I3AEOBMRQ6naCWBJWl4CvJ72HZJ0FrgolT/R0HTjPu+R1A10A1QqFWq1WutRJZV5sHXlUPOKJZtIn0czODhYeptTxbFMP7MlDnAsRTRN+pIWUz9LXw6cAf6O+vTMpIiIHqAHoLOzM6rV6rjb2rn3IPccbf9vyh3fVC21vVqtxkSOw3TiWKaf2RIHOJYiikzv/D7wakT8NCL+P/D3wMeBRWm6B2Ap0J+W+4FlAGn7QuDNxvJR9jEzszYokvR/DKyW9P40N78WeBF4HLgp1ekCDqblQ2mdtP2xiIhUvjFd3bMcWAF8v5wwzMysiCJz+k9KOgA8AwwBz1KffvkusE/SXalsV9plF/AtSX3AAPUrdoiIFyTtp/6CMQRsiYhflhyPmZmdQ6EJ74jYDmwfUfwKo1x9ExE/Bz49Rjt3A3e32EczMyuJv5FrZpYRJ30zs4w46ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWESd9M7OMOOmbmWXESd/MLCNO+mZmGXHSNzPLiJO+mVlGmiZ9SZdKeq7h72eSvijpQkm9ko6lx8WpviTdJ6lP0vOSVjW01ZXqH5PUNfazmpnZZGia9CPi5Yi4PCIuB64A3gYeArYBhyNiBXA4rQNcS/1WiCuAbuB+AEkXUr8Ry1XUb76yffiFwszM2qPV6Z21wI8i4jVgA7Anle8BbkzLG4AHo+4J6jdQvwS4BuiNiIGIOA30AusnGoCZmRVX6HaJDTYC307LlYh4Iy3/BKik5SXA6w37nEhlY5X/Gknd1N8hUKlUqNVqLXbxVyrzYOvKoXHvP14T6fNoBgcHS29zqjiW6We2xAGOpYjCSV/SecANwO0jt0VESIoyOhQRPdRvvE5nZ2dUq9Vxt7Vz70HuOdrq69rEHd9ULbW9Wq3GRI7DdOJYpp/ZEgc4liJamd65FngmIk6m9ZNp2ob0eCqV9wPLGvZbmsrGKjczszZpJel/hl9N7QAcAoavwOkCDjaU35Ku4lkNnE3TQI8C6yQtTh/grktlZmbWJoXmPiTNBz4J/HFD8Q5gv6TNwGvAzan8EeA6oI/6lT63AUTEgKQ7gadSvTsiYmDCEZiZWWGFkn5EvAVcNKLsTepX84ysG8CWMdrZDexuvZtmZlYGfyPXzCwjTvpmZhlx0jczy4iTvplZRpz0zcwy4qRvZpYRJ30zs4w46ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWESd9M7OMFEr6khZJOiDph5JekvQxSRdK6pV0LD0uTnUl6T5JfZKel7SqoZ2uVP+YpK6xn9HMzCZD0TP9rwPfi4jfAT4KvARsAw5HxArgcFqH+r10V6S/buB+AEkXAtuBq4Arge3DLxRmZtYeTZO+pIXAJ4BdABHxi4g4A2wA9qRqe4Ab0/IG4MGoewJYlG6cfg3QGxEDEXEa6AXWlxiLmZk1UeRMfznwU+CvJT0r6YF0z9xKuuE5wE+ASlpeArzesP+JVDZWuZmZtUmRe+TOBVYBn4uIJyV9nV9N5QD1++JKijI6JKmb+rQQlUqFWq027rYq82DryqEyutWSifR5NIODg6W3OVUcy/QzW+IAx1JEkaR/AjgREU+m9QPUk/5JSZdExBtp+uZU2t4PLGvYf2kq6weqI8prI58sInqAHoDOzs6oVqsjqxS2c+9B7jla6N7vpTq+qVpqe7VajYkch+nEsUw/syUOcCxFNJ3eiYifAK9LujQVrQVeBA4Bw1fgdAEH0/Ih4JZ0Fc9q4GyaBnoUWCdpcfoAd10qMzOzNil6Gvw5YK+k84BXgNuov2Dsl7QZeA24OdV9BLgO6APeTnWJiAFJdwJPpXp3RMRAKVGYmVkhhZJ+RDwHdI6yae0odQPYMkY7u4HdLfTPzMxK5G/kmpllxEnfzCwjTvpmZhlx0jczy4iTvplZRpz0zcwy4qRvZpYRJ30zs4w46ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWESd9M7OMFEr6ko5LOirpOUlPp7ILJfVKOpYeF6dySbpPUp+k5yWtaminK9U/JqlrrOczM7PJ0cqZ/pqIuDwihu+gtQ04HBErgMNpHeBaYEX66wbuh/qLBLAduAq4Etg+/EJhZmbtMZHpnQ3AnrS8B7ixofzBqHsCWCTpEuAaoDciBiLiNNALrJ/A85uZWYtUv6Vtk0rSq8BpIIC/iogeSWciYlHaLuB0RCyS9DCwIyL+OW07DHwJqAIXRMRdqfzLwDsR8dURz9VN/R0ClUrlin379o07uFMDZzn5zrh3H7eVSxaW2t7g4CALFiwotc2p4limn9kSBziWYWvWrDnSMCvzawrdGB34vYjol/RbQK+kHzZujIiQ1PzVo4CI6AF6ADo7O6NarY67rZ17D3LP0aIhluf4pmqp7dVqNSZyHKYTxzL9zJY4wLEUUWh6JyL60+Mp4CHqc/In07QN6fFUqt4PLGvYfWkqG6vczMzapGnSlzRf0geGl4F1wA+AQ8DwFThdwMG0fAi4JV3Fsxo4GxFvAI8C6yQtTh/grktlZmbWJkXmPirAQ/Vpe+YCfxsR35P0FLBf0mbgNeDmVP8R4DqgD3gbuA0gIgYk3Qk8lerdEREDpUViZmZNNU36EfEK8NFRyt8E1o5SHsCWMdraDexuvZtmZlYGfyPXzCwjTvpmZhlx0jczy4iTvplZRpz0zcwy4qRvZpYRJ30zs4w46ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWESd9M7OMFL6BrKQ5wNNAf0R8StJyYB9wEXAE+GxE/ELS+cCDwBXAm8AfRMTx1MbtwGbgl8DnI8J3zpolOrZ9t1C9rSuHuLVg3SKO77i+tLbMctDKmf4XgJca1r8C3BsRHwJOU0/mpMfTqfzeVA9JlwEbgQ8D64FvpBcSMzNrk0JJX9JS4HrggbQu4GrgQKqyB7gxLW9I66Tta1P9DcC+iHg3Il6lfjvFK0uIwczMCio6vfM14M+AD6T1i4AzETGU1k8AS9LyEuB1gIgYknQ21V8CPNHQZuM+75HUDXQDVCoVarVawS7+R5V59emEdptIn0czODhYeptlK3qcyx6TqTwuM2FcipgtcYBjKaJp0pf0KeBURByRVC29ByNERA/QA9DZ2RnV6vifcufeg9xztPDHFqU5vqlaanu1Wo2JHId2KDpPv3XlUKljUvaxbsVMGJciZksc4FiKKPK/7+PADZKuAy4AfhP4OrBI0tx0tr8U6E/1+4FlwAlJc4GF1D/QHS4f1riPmZm1QdM5/Yi4PSKWRkQH9Q9iH4uITcDjwE2pWhdwMC0fSuuk7Y9FRKTyjZLOT1f+rAC+X1okZmbW1ETeZ38J2CfpLuBZYFcq3wV8S1IfMED9hYKIeEHSfuBFYAjYEhG/nMDzm5lZi1pK+hFRA2pp+RVGufomIn4OfHqM/e8G7m61k2ZmVg5/I9fMLCNO+mZmGXHSNzPLiJO+mVlGnPTNzDLipG9mlhEnfTOzjDjpm5llxEnfzCwjTvpmZhlx0jczy4iTvplZRpz0zcwy4qRvZpYRJ30zs4w0TfqSLpD0fUn/IukFSf8zlS+X9KSkPknfkXReKj8/rfel7R0Nbd2eyl+WdM2kRWVmZqMqcqb/LnB1RHwUuBxYL2k18BXg3oj4EHAa2JzqbwZOp/J7Uz0kXUb9LlofBtYD35A0p8RYzMysiSL3yI2IGEyr70t/AVwNHEjle4Ab0/KGtE7avlaSUvm+iHg3Il4F+hjlzltmZjZ5Cs3pS5oj6TngFNAL/Ag4ExFDqcoJYElaXgK8DpC2nwUuaiwfZR8zM2uDQvfITTcwv1zSIuAh4Hcmq0OSuoFugEqlQq1WG3dblXmwdeVQ84olm0ifRzM4OFh6m2UrepzLHpOpPC4zYVyKmC1xgGMpotUbo5+R9DjwMWCRpLnpbH4p0J+q9QPLgBOS5gILgTcbyoc17tP4HD1AD0BnZ2dUq9WWAmq0c+9B7jnaUoilOL6pWmp7tVqNiRyHdrh123cL1du6cqjUMSn7WLdiJoxLEbMlDnAsRRS5eueD6QwfSfOATwIvAY8DN6VqXcDBtHworZO2PxYRkco3pqt7lgMrgO+XFIeZmRVQ5JTrEmBPutLmN4D9EfGwpBeBfZLuAp4FdqX6u4BvSeoDBqhfsUNEvCBpP/AiMARsSdNGZmbWJk2TfkQ8D/zuKOWvMMrVNxHxc+DTY7R1N3B36900M7My+Bu5ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWkfZ/c8nMJqSj4Bfhitq6cqjwl+uO77i+1Oe29vOZvplZRpz0zcwy4qRvZpYRJ30zs4w46ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMlLkdonLJD0u6UVJL0j6Qiq/UFKvpGPpcXEql6T7JPVJel7Sqoa2ulL9Y5K6xnpOMzObHEXO9IeArRFxGbAa2CLpMmAbcDgiVgCH0zrAtdTvf7sC6Abuh/qLBLAduIr6Hbe2D79QmJlZezRN+hHxRkQ8k5b/H/Wboi8BNgB7UrU9wI1peQPwYNQ9ASySdAlwDdAbEQMRcRroBdaXGYyZmZ2bIqJ4ZakD+CfgI8CPI2JRKhdwOiIWSXoY2BER/5y2HQa+BFSBCyLirlT+ZeCdiPjqiOfopv4OgUqlcsW+ffvGHdypgbOcfGfcu4/byiULS21vcHCQBQsWlNpm2Y72ny1UrzKPUsek7GPdiqkal6LHuqhWxmQqj3cRM+H/SlETiWXNmjVHIqJztG2Ff1pZ0gLgfwFfjIif1fN8XUSEpOKvHucQET1AD0BnZ2dUq9Vxt7Vz70HuOdr+X48+vqlaanu1Wo2JHId2KPrTvFtXDpU6JmUf61ZM1bgUPdZFtTImU3m8i5gJ/1eKmqxYCl29I+l91BP+3oj4+1R8Mk3bkB5PpfJ+YFnD7ktT2VjlZmbWJkWu3hGwC3gpIv68YdMhYPgKnC7gYEP5LekqntXA2Yh4A3gUWCdpcfoAd10qMzOzNinynu7jwGeBo5KeS2X/HdgB7Je0GXgNuDltewS4DugD3gZuA4iIAUl3Ak+lendExEAZQZiZWTFNk376QFZjbF47Sv0AtozR1m5gdysdNDOz8vgbuWZmGXHSNzPLSPuvZzQzmyE6Sr48thXfXD9/Utr1mb6ZWUac9M3MMuKkb2aWESd9M7OMOOmbmWXESd/MLCNO+mZmGXHSNzPLiJO+mVlGnPTNzDLipG9mlhEnfTOzjBS5c9ZuSack/aCh7EJJvZKOpcfFqVyS7pPUJ+l5Sasa9ulK9Y9J6hrtuczMbHIVOdP/JrB+RNk24HBErAAOp3WAa4EV6a8buB/qLxLAduAq4Epg+/ALhZmZtU/TpB8R/wSMvK3hBmBPWt4D3NhQ/mDUPQEsSjdNvwbojYiBiDgN9PIfX0jMzGySqX53wyaVpA7g4Yj4SFo/ExGL0rKA0xGxSNLDwI50i0UkHQa+BFSBCyLirlT+ZeCdiPjqKM/VTf1dApVK5Yp9+/aNO7hTA2c5+c64dx+3lUsWltre4OAgCxYsKLXNsh3tP1uoXmUepY5J2ce6FVM1LkWPdVGtjMlUHu8iyh6Tso91K5YvnDPuWNasWXMkIjpH2zbhm6hEREhq/spRvL0eoAegs7MzqtXquNvaufcg9xxt/31ijm+qltperVZjIsehHW4teLOJrSuHSh2Tso91K6ZqXIoe66JaGZOpPN5FlD0mZR/rVnxz/fxJ+fc13qt3TqZpG9LjqVTeDyxrqLc0lY1VbmZmbTTepH8IGL4Cpws42FB+S7qKZzVwNiLeAB4F1klanD7AXZfKzMysjZq+p5P0bepz8hdLOkH9KpwdwH5Jm4HXgJtT9UeA64A+4G3gNoCIGJB0J/BUqndHRIz8cNjMzCZZ06QfEZ8ZY9PaUeoGsGWMdnYDu1vqnZmZlcrfyDUzy4iTvplZRpz0zcwy4qRvZpYRJ30zs4w46ZuZZcRJ38wsI076ZmYZcdI3M8uIk76ZWUac9M3MMuKkb2aWESd9M7OMOOmbmWXESd/MLCNO+mZmGWl70pe0XtLLkvokbWv385uZ5aytSV/SHOAvgGuBy4DPSLqsnX0wM8tZu8/0rwT6IuKViPgFsA/Y0OY+mJllS/Xb2rbpyaSbgPUR8Udp/bPAVRHxJw11uoHutHop8PIEnvJi4F8nsP90MVviAMcyHc2WOMCxDPvtiPjgaBua3hi93SKiB+gpoy1JT0dEZxltTaXZEgc4lulotsQBjqWIdk/v9APLGtaXpjIzM2uDdif9p4AVkpZLOg/YCBxqcx/MzLLV1umdiBiS9CfAo8AcYHdEvDCJT1nKNNE0MFviAMcyHc2WOMCxNNXWD3LNzGxq+Ru5ZmYZcdI3M8vIrEj6kuZIelbSw6NsO1/Sd9LPPjwpqWMKulhYk1hulfRTSc+lvz+aij4WIem4pKOpn0+Psl2S7kvj8rykVVPRzyIKxFKVdLZhXP7HVPSzGUmLJB2Q9ENJL0n62IjtM2lMmsUyU8bk0oY+PifpZ5K+OKJOqeMy7a7TH6cvAC8BvznKts3A6Yj4kKSNwFeAP2hn51p0rlgAvtP4ZbZpbk1EjPXlkmuBFenvKuD+9DhdnSsWgP8bEZ9qW2/G5+vA9yLipnT13PtHbJ9JY9IsFpgBYxIRLwOXw3s/U9MPPDSiWqnjMuPP9CUtBa4HHhijygZgT1o+AKyVpHb0rVUFYplNNgAPRt0TwCJJl0x1p2YrSQuBTwC7ACLiFxFxZkS1GTEmBWOZidYCP4qI10aUlzouMz7pA18D/gz4tzG2LwFeh/olo8BZ4KK29Kx1X+PcsQD8l/QW74CkZeeoN9UC+EdJR9JPa4z03rgkJ1LZdNQsFoCPSfoXSf8g6cPt7FxBy4GfAn+dpg8fkDR/RJ2ZMiZFYoHpPyYjbQS+PUp5qeMyo5O+pE8BpyLiyFT3ZaIKxvJ/gI6I+M9AL796BzMd/V5ErKL+1nSLpE9MdYcmoFksz1D/rZOPAjuB/93m/hUxF1gF3B8Rvwu8BczUnzYvEstMGJP3pCmqG4C/m+znmtFJH/g4cIOk49R/sfNqSX8zos57P/0gaS6wEHiznZ0sqGksEfFmRLybVh8ArmhvF4uLiP70eIr6HOWVI6rMmJ/kaBZLRPwsIgbT8iPA+yRd3PaOntsJ4EREPJnWD1BPnI1mypg0jWWGjEmja4FnIuLkKNtKHZcZnfQj4vaIWBoRHdTfGj0WEX84otohoCst35TqTLtvpBWJZcQ83g3UP/CddiTNl/SB4WVgHfCDEdUOAbekKxNWA2cj4o02d7WpIrFI+k/DnxNJupL6/6tpdWIRET8BXpd0aSpaC7w4otqMGJMiscyEMRnhM4w+tQMlj8tsuXrn10i6A3g6Ig5R/7DnW5L6gAHqCXXGGBHL5yXdAAxRj+XWqezbOVSAh9L/ubnA30bE9yT9V4CI+EvgEeA6oA94G7htivraTJFYbgL+m6Qh4B1g43Q8sQA+B+xNUwmvALfN0DGB5rHMlDEZPpn4JPDHDWWTNi7+GQYzs4zM6OkdMzNrjZO+mVlGnPTNzDLipG9mlhEnfTOzjDjpm5llxEnfzCwj/w4nLu3Z6fE/JAAAAABJRU5ErkJggg==\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "repeated_names[(repeated_names.issn_count > 3) & (repeated_names.issn_count < 8)].issn_count.hist()" ] }, { "cell_type": "code", "execution_count": 21, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 21, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXQAAAD4CAYAAAD8Zh1EAAAAOXRFWHRTb2Z0d2FyZQBNYXRwbG90bGliIHZlcnNpb24zLjMuMCwgaHR0cHM6Ly9tYXRwbG90bGliLm9yZy86wFpkAAAACXBIWXMAAAsTAAALEwEAmpwYAAAWUElEQVR4nO3df7BcdX3G8fdjEiFyaQBjr2mIjQ6MHUtaITuAg3X2Yq0RGPijOA1DlVCd6ygo2jg10SlWZjqiFX+NjpgRKlLLBQHbNGCVIrdgO4L3YuAmBDRCpiYiAYLBxYhGP/1jv1eX293snv1x9/Q7z2tmJ2fP+e7Z55y7ee65Z38pIjAzs///njfsAGZm1h8udDOzTLjQzcwy4UI3M8uEC93MLBMLh3XHS5cujZUrVxa6zTPPPMMRRxwxmEA9Kmu2suaC8mZzruLKmq2suaD7bNPT009ExIuaLoyIoVxWr14dRd1xxx2FbzNfypqtrLkiypvNuYora7ay5oroPhswFS161adczMwy4UI3M8uEC93MLBMudDOzTLjQzcwy4UI3M8tEx4UuaYGk70ra0mTZYZKul7RT0t2SVvY1pZmZtVXkCP0SYEeLZW8BnoqI44BPAB/pNZiZmRXTUaFLOhY4E/hCiyHnANek6RuB10pS7/HMzKxTig6+4ELSjcCHgSOB90bEWXOWbwPWRMTudP0HwCkR8cSccePAOMDo6OjqiYmJQmFrtRojIyPM7Nlf6Hb9tGr5kqbzZ7OVTVlzQXmzOVdxZc1W1lzQfbaxsbHpiKg0W9b2s1wknQXsjYhpSdXC994gIjYBmwAqlUpUq8VWNzk5SbVaZd2GW3qJ0ZNd51ebzp/NVjZlzQXlzeZcxZU1W1lzwWCydXLK5TTgbEm7gAngdEn/NGfMHmAFgKSFwBLgyT7mNDOzNtoWekRsjIhjI2IlsBb4ZkT85Zxhm4EL0vS5aYy/rNTMbB51/fG5ki6j/qlfm4GrgGsl7QT2US9+MzObR4UKPSImgck0fWnD/J8Db+xnMDMzK8bvFDUzy4QL3cwsEy50M7NMuNDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLRttAlHS7pHkn3Sdou6UNNxqyT9Likreny1sHENTOzVjr5xqJngdMjoiZpEfAtSV+LiG/PGXd9RFzc/4hmZtaJtoWevuy5lq4uShd/AbSZWcl0dA5d0gJJW4G9wG0RcXeTYX8u6X5JN0pa0c+QZmbWnuoH4B0Olo4Cvgq8MyK2Ncx/IVCLiGclvQ34i4g4vcntx4FxgNHR0dUTExOFwtZqNUZGRpjZs7/Q7fpp1fIlTefPZiubsuaC8mZzruLKmq2suaD7bGNjY9MRUWm2rFChA0i6FPhZRHysxfIFwL6IaN58SaVSiampqUL3PTk5SbVaZeWGWwrdrp92XX5m0/mz2cqmrLmgvNmcq7iyZitrLug+m6SWhd7Jq1xelI7MkbQYeB3w4Jwxyxqung3sKJzSzMx60smrXJYB16Qj7+cBN0TEFkmXAVMRsRl4l6SzgYPAPmDdoAKbmVlznbzK5X7gxCbzL22Y3ghs7G80MzMrwu8UNTPLhAvdzCwTLnQzs0y40M3MMuFCNzPLhAvdzCwTLnQzs0y40M3MMuFCNzPLhAvdzCwTLnQzs0y40M3MMuFCNzPLhAvdzCwTLnQzs0y40M3MMuFCNzPLRCffKXq4pHsk3Sdpu6QPNRlzmKTrJe2UdLeklQNJa2ZmLXVyhP4scHpE/DHwSmCNpFPnjHkL8FREHAd8AvhIX1OamVlbbQs96mrp6qJ0iTnDzgGuSdM3Aq+VpL6lNDOzthQxt5ubDJIWANPAccBnI+J9c5ZvA9ZExO50/QfAKRHxxJxx48A4wOjo6OqJiYlCYWu1GiMjI8zs2V/odv20avmSpvNnsw1Kt9s8uhgeO9Dbfbfa5l4Nep91y7mKK2u2suaC7rONjY1NR0Sl2bKFnawgIn4FvFLSUcBXJZ0QEduKBomITcAmgEqlEtVqtdDtJycnqVarrNtwS9G77ptd51ebzp/NNijdbvP6VQe5YqajH3NLrba5V4PeZ91yruLKmq2suWAw2Qq9yiUifgLcAayZs2gPsAJA0kJgCfBkH/KZmVmHOnmVy4vSkTmSFgOvAx6cM2wzcEGaPhf4ZnRyLsfMzPqmk7/FlwHXpPPozwNuiIgtki4DpiJiM3AVcK2kncA+YO3AEpuZWVNtCz0i7gdObDL/0obpnwNv7G80MzMrwu8UNTPLhAvdzCwTLnQzs0y40M3MMuFCNzPLhAvdzCwTLnQzs0y40M3MMuFCNzPLhAvdzCwTLnQzs0y40M3MMuFCNzPLhAvdzCwTLnQzs0y40M3MMuFCNzPLRCffKbpC0h2SHpC0XdIlTcZUJe2XtDVdLm22LjMzG5xOvlP0ILA+Iu6VdCQwLem2iHhgzri7IuKs/kc0M7NOtD1Cj4hHI+LeNP1TYAewfNDBzMysGEVE54OllcCdwAkR8XTD/CpwE7Ab+BHw3ojY3uT248A4wOjo6OqJiYlCYWu1GiMjI8zs2V/odv20avmSpvNnsw1Kt9s8uhgeO9Dbfbfa5l4Nep91y7mKK2u2suaC7rONjY1NR0Sl2bKOC13SCPCfwN9HxM1zlv0O8OuIqEk6A/hURBx/qPVVKpWYmprq6L5nTU5OUq1WWbnhlkK366ddl5/ZdP5stkHpdpvXrzrIFTOdnFlrrdU292rQ+6xbzlVcWbOVNRd0n01Sy0Lv6FUukhZRPwL/8twyB4iIpyOilqZvBRZJWlo4qZmZda2TV7kIuArYEREfbzHmxWkckk5O632yn0HNzOzQOvlb/DTgTcCMpK1p3vuBlwBExJXAucDbJR0EDgBro8jJeTMz61nbQo+IbwFqM+YzwGf6FcrMzIrzO0XNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLRyXeKrpB0h6QHJG2XdEmTMZL0aUk7Jd0v6aTBxDUzs1Y6+U7Rg8D6iLhX0pHAtKTbIuKBhjFvAI5Pl1OAz6V/zcxsnrQ9Qo+IRyPi3jT9U2AHsHzOsHOAL0Xdt4GjJC3re1ozM2tJEdH5YGklcCdwQkQ83TB/C3B5+kJpJN0OvC8ipubcfhwYBxgdHV09MTFRKGytVmNkZISZPfsL3a6fVi1f0nT+bLZB6XabRxfDYwd6u+9W29yrQe+zbjlXcWXNVtZc0H22sbGx6YioNFvWySkXACSNADcB724s8yIiYhOwCaBSqUS1Wi10+8nJSarVKus23NLN3ffFrvOrTefPZhuUbrd5/aqDXDHT8Y+5qVbb3KtB77NuOVdxZc1W1lwwmGwdvcpF0iLqZf7liLi5yZA9wIqG68emeWZmNk86eZWLgKuAHRHx8RbDNgNvTq92ORXYHxGP9jGnmZm10cnf4qcBbwJmJG1N894PvAQgIq4EbgXOAHYCPwMu7HtSMzM7pLaFnp7oVJsxAVzUr1BmZlac3ylqZpYJF7qZWSZc6GZmmXChm5llwoVuZpYJF7qZWSZc6GZmmXChm5llwoVuZpYJF7qZWSZc6GZmmXChm5llwoVuZpYJF7qZWSZc6GZmmXChm5llwoVuZpaJTr5T9GpJeyVta7G8Kmm/pK3pcmn/Y5qZWTudfKfoF4HPAF86xJi7IuKsviQyM7OutD1Cj4g7gX3zkMXMzHqg+vc7txkkrQS2RMQJTZZVgZuA3cCPgPdGxPYW6xkHxgFGR0dXT0xMFApbq9UYGRlhZs/+Qrfrp1XLlzSdP5ttULrd5tHF8NiB3u671Tb3atD7rFvOVVxZs5U1F3SfbWxsbDoiKs2W9aPQfwf4dUTUJJ0BfCoijm+3zkqlElNTU23vu9Hk5CTVapWVG24pdLt+2nX5mU3nz2YblG63ef2qg1wx08mZtdZabXOvBr3PuuVcxZU1W1lzQffZJLUs9J5f5RIRT0dELU3fCiyStLTX9ZqZWTE9F7qkF0tSmj45rfPJXtdrZmbFtP1bXNJ1QBVYKmk38EFgEUBEXAmcC7xd0kHgALA2OjmPY2ZmfdW20CPivDbLP0P9ZY1mZjZEfqeomVkmXOhmZplwoZuZZcKFbmaWCRe6mVkmXOhmZplwoZuZZcKFbmaWCRe6mVkmXOhmZplwoZuZZcKFbmaWCRe6mVkmXOhmZplwoZuZZcKFbmaWCRe6mVkm2ha6pKsl7ZW0rcVySfq0pJ2S7pd0Uv9jmplZO50coX8RWHOI5W8Ajk+XceBzvccyM7Oi2hZ6RNwJ7DvEkHOAL0Xdt4GjJC3rV0AzM+uMIqL9IGklsCUiTmiybAtweUR8K12/HXhfREw1GTtO/Sie0dHR1RMTE4XC1mo1RkZGmNmzv9Dt+mnV8iVN589mG5Rut3l0MTx2oLf7brXNvRr0PuuWcxVX1myHyjXsHul2n42NjU1HRKXZsoU9JysgIjYBmwAqlUpUq9VCt5+cnKRarbJuwy0DSNeZXedXm86fzTYo3W7z+lUHuWKmtx9zq23u1aD3Wbecq7iyZjtUrmH3yCD2WT9e5bIHWNFw/dg0z8zM5lE/Cn0z8Ob0apdTgf0R8Wgf1mtmZgW0/Vtc0nVAFVgqaTfwQWARQERcCdwKnAHsBH4GXDiosGZm1lrbQo+I89osD+CiviUyM7Ou+J2iZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZ6KjQJa2R9JCknZI2NFm+TtLjkramy1v7H9XMzA6lk+8UXQB8FngdsBv4jqTNEfHAnKHXR8TFA8hoZmYd6OQI/WRgZ0Q8HBG/ACaAcwYby8zMilL9O54PMUA6F1gTEW9N198EnNJ4NC5pHfBh4HHge8B7IuKHTdY1DowDjI6Orp6YmCgUtlarMTIywsye/YVu10+rli9pOn8226B0u82ji+GxA73dd6tt7tWg91m3nKu4smY7VK5h90i3+2xsbGw6IirNlrU95dKhfwOui4hnJb0NuAY4fe6giNgEbAKoVCpRrVYL3cnk5CTVapV1G27pPXGXdp1fbTp/NtugdLvN61cd5IqZ3n7Mrba5V4PeZ91yruLKmu1QuYbdI4PYZ52cctkDrGi4fmya9xsR8WREPJuufgFY3Z94ZmbWqU4K/TvA8ZJeKun5wFpgc+MAScsarp4N7OhfRDMz60Tbv8Uj4qCki4GvAwuAqyNiu6TLgKmI2Ay8S9LZwEFgH7BugJnNzKyJjk6uRsStwK1z5l3aML0R2NjfaGZmVoTfKWpmlgkXuplZJlzoZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZcKGbmWXChW5mlgkXuplZJlzoZmaZcKGbmWXChW5mlomOCl3SGkkPSdopaUOT5YdJuj4tv1vSyr4nNTOzQ2pb6JIWAJ8F3gC8AjhP0ivmDHsL8FREHAd8AvhIv4OamdmhdXKEfjKwMyIejohfABPAOXPGnANck6ZvBF4rSf2LaWZm7XTyJdHLgR82XN8NnNJqTEQclLQfeCHwROMgSePAeLpak/RQwbxL565zvqn13x5Dz9bMu/qQ6xDb3KtS7jOcqxtlzVbKXOn/VLfZfr/Vgk4KvW8iYhOwqdvbS5qKiEofI/VNWbOVNReUN5tzFVfWbGXNBYPJ1skplz3Aiobrx6Z5TcdIWggsAZ7sR0AzM+tMJ4X+HeB4SS+V9HxgLbB5zpjNwAVp+lzgmxER/YtpZmbttD3lks6JXwx8HVgAXB0R2yVdBkxFxGbgKuBaSTuBfdRLfxC6Pl0zD8qaray5oLzZnKu4smYray4YQDb5QNrMLA9+p6iZWSZc6GZmmRh6oUu6WtJeSdsa5h0j6TZJ30//Hp3mS9Kn00cM3C/ppIbbXJDGf1/SBc3uq2CuFZLukPSApO2SLilDNkmHS7pH0n0p14fS/Jemj13YmT6G4flpfsuPZZC0Mc1/SNLre8k1J+MCSd+VtKUs2STtkjQjaaukqTRv6I+ztM6jJN0o6UFJOyS9atjZJL087avZy9OS3j3sXGl970mP/W2Srkv/J4b+GEvrvCTl2i7p3Wne/O2ziBjqBXgNcBKwrWHeR4ENaXoD8JE0fQbwNUDAqcDdaf4xwMPp36PT9NE95loGnJSmjwS+R/2jD4aaLa1/JE0vAu5O93cDsDbNvxJ4e5p+B3Blml4LXJ+mXwHcBxwGvBT4AbCgTz/Tvwb+GdiSrg89G7ALWDpn3tAfZ2m91wBvTdPPB44qS7a07gXAj6m/oWXYj//lwCPA4obH1rqSPMZOALYBL6D+gpP/AI6bz33W8w+7Tw+YlTy30B8ClqXpZcBDafrzwHlzxwHnAZ9vmP+ccX3K+K/A68qULT1w7qX+zt0ngIVp/quAr6fprwOvStML0zgBG4GNDev6zbgeMx0L3A6cDmxJ9zX0bDQv9KH/LKm/Z+MR0gsUypStYV1/BvxXGXLx23elH5MeM1uA15fkMfZG4KqG638L/M187rOhn3JpYTQiHk3TPwZG03SzjyFYfoj5fZH+TDuR+tHw0LOlUxpbgb3AbdSPLn4SEQeb3MdzPpYBmP1YhkHts09SfxD/Ol1/YUmyBfANSdOqfwQFlOBnSf3o8HHgH1U/TfUFSUeUJNustcB1aXqouSJiD/Ax4H+AR6k/ZqYpx2NsG/Ankl4o6QXUj8BXMI/7rKyF/htR/xU1tNdWShoBbgLeHRFPNy4bVraI+FVEvJL60fDJwB/Md4ZmJJ0F7I2I6WFnaeLVEXES9U8NvUjSaxoXDvFxtpD6KcfPRcSJwDPU/ywvQzbSueizga/MXTaMXOn88znUfxH+HnAEsGY+M7QSETuof9LsN4B/B7YCv5ozZqD7rKyF/pikZQDp371pfquPIejk4wkKk7SIepl/OSJuLlM2gIj4CXAH9T8xj1L9Yxfm3kerj2UYRK7TgLMl7aL+qZynA58qQ7Z0ZEdE7AW+Sv0XYRl+lruB3RFxd7p+I/WCL0M2qP8CvDciHkvXh53rT4FHIuLxiPglcDP1x93QH2MAEXFVRKyOiNcAT1F/7m3+9lk/zrH1euH/nkP/B577JMJH0/SZPPdJhHvS/GOon4c8Ol0eAY7pMZOALwGfnDN/qNmAFwFHpenFwF3AWdSPoBqfFHpHmr6I5z4pdEOa/kOe+6TQw/TpSdG0/iq/fVJ0qNmoH8Ud2TD939SP6ob+OEvrvQt4eZr+u5SrLNkmgAtL9Pg/BdhO/fkjUX9C+Z3Dfow15Pvd9O9LgAepP8E9b/usL/95e9wB11E/F/ZL6kcrb6F+jut24PvUnyk+Jo0V9S/b+AEwA1Qa1vNXwM50ubAPuV5N/U+j+6n/6bSV+jmxoWYD/gj4bsq1Dbg0zX8ZcE+6j68Ah6X5h6frO9PylzWs6wMp70PAG/r8c63y20IfarZ0//ely3bgA2n+0B9naZ2vBKbSz/Rf0n/ioWej/svvSWBJw7wy5PoQ9bLcBlxLvZRL8fin/sv5gfRYe+187zO/9d/MLBNlPYduZmYFudDNzDLhQjczy4QL3cwsEy50M7NMuNDNzDLhQjczy8T/AponGfa1VcRIAAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "repeated_names[repeated_names.issn_count > 1000].issn_count.hist(bins=10)" ] }, { "cell_type": "code", "execution_count": 22, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
3499Bulletin.2752
7632Newsletter.2715
8317Rapport.1050
23662Proceedings.1403
45839Annual report /1090
45907Annual report.8980
45964Annuaire.1260
47217Rapport annuel.2656
\n", "
" ], "text/plain": [ " name issn_count\n", "3499 Bulletin. 2752\n", "7632 Newsletter. 2715\n", "8317 Rapport. 1050\n", "23662 Proceedings. 1403\n", "45839 Annual report / 1090\n", "45907 Annual report. 8980\n", "45964 Annuaire. 1260\n", "47217 Rapport annuel. 2656" ] }, "execution_count": 22, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names[repeated_names.issn_count > 1000]" ] }, { "cell_type": "code", "execution_count": 23, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
102Bulletin d'information.693
3218Bulletin de liaison.510
3499Bulletin.2752
7632Newsletter.2715
8317Rapport.1050
23662Proceedings.1403
45794Report.743
45839Annual report /1090
45907Annual report.8980
45964Annuaire.1260
46370Jaarverslag.675
47142Rapport d'activité.660
47217Rapport annuel.2656
49289Jahresbericht.518
57558Annual report760
121599Alumni directory /511
128827Bulletin municipal.521
150529˜La œLettre.623
168933Local climatological data.613
269004Estimates.535
\n", "
" ], "text/plain": [ " name issn_count\n", "102 Bulletin d'information. 693\n", "3218 Bulletin de liaison. 510\n", "3499 Bulletin. 2752\n", "7632 Newsletter. 2715\n", "8317 Rapport. 1050\n", "23662 Proceedings. 1403\n", "45794 Report. 743\n", "45839 Annual report / 1090\n", "45907 Annual report. 8980\n", "45964 Annuaire. 1260\n", "46370 Jaarverslag. 675\n", "47142 Rapport d'activité. 660\n", "47217 Rapport annuel. 2656\n", "49289 Jahresbericht. 518\n", "57558 Annual report 760\n", "121599 Alumni directory / 511\n", "128827 Bulletin municipal. 521\n", "150529 ˜La œLettre. 623\n", "168933 Local climatological data. 613\n", "269004 Estimates. 535" ] }, "execution_count": 23, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names[repeated_names.issn_count > 500]" ] }, { "cell_type": "code", "execution_count": 24, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
102Bulletin d'information.693
2665Newsletter /259
3218Bulletin de liaison.510
3499Bulletin.2752
3926Boletín.216
.........
425644Rapport d'activité ...394
532500Relatório e contas.247
603144Bildung und Beruf regional.292
1006131Vies de famille.222
1110247Country risk service.271
\n", "

66 rows × 2 columns

\n", "
" ], "text/plain": [ " name issn_count\n", "102 Bulletin d'information. 693\n", "2665 Newsletter / 259\n", "3218 Bulletin de liaison. 510\n", "3499 Bulletin. 2752\n", "3926 Boletín. 216\n", "... ... ...\n", "425644 Rapport d'activité ... 394\n", "532500 Relatório e contas. 247\n", "603144 Bildung und Beruf regional. 292\n", "1006131 Vies de famille. 222\n", "1110247 Country risk service. 271\n", "\n", "[66 rows x 2 columns]" ] }, "execution_count": 24, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names[repeated_names.issn_count > 200]" ] }, { "cell_type": "code", "execution_count": 25, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
102Bulletin d'information.693
2665Newsletter /259
3218Bulletin de liaison.510
3499Bulletin.2752
3926Boletín.216
.........
1306798Country commerce.120
1318569Bible studies for life.159
1796742LexisNexis practice guide.101
2628387Operational risk report.119
2650557Interempresas net.108
\n", "

191 rows × 2 columns

\n", "
" ], "text/plain": [ " name issn_count\n", "102 Bulletin d'information. 693\n", "2665 Newsletter / 259\n", "3218 Bulletin de liaison. 510\n", "3499 Bulletin. 2752\n", "3926 Boletín. 216\n", "... ... ...\n", "1306798 Country commerce. 120\n", "1318569 Bible studies for life. 159\n", "1796742 LexisNexis practice guide. 101\n", "2628387 Operational risk report. 119\n", "2650557 Interempresas net. 108\n", "\n", "[191 rows x 2 columns]" ] }, "execution_count": 25, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names[repeated_names.issn_count > 100]" ] }, { "cell_type": "code", "execution_count": 26, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
5Activitas Nervosa Superior.2
11Library journal.2
23Acta cardiologica.2
26Actualidad económica.3
31Acta Ornithologica.3
.........
2929626Modern machine shop México.2
2929635Lecture notes in control and information scien...2
2929646Critical Studies in Dance Leadership and Inclu...2
2929691Nigerian Journal of Wildlife Management2
2929702Verzeichniss der Kunstwerke lebender Künstler,...2
\n", "

194241 rows × 2 columns

\n", "
" ], "text/plain": [ " name issn_count\n", "5 Activitas Nervosa Superior. 2\n", "11 Library journal. 2\n", "23 Acta cardiologica. 2\n", "26 Actualidad económica. 3\n", "31 Acta Ornithologica. 3\n", "... ... ...\n", "2929626 Modern machine shop México. 2\n", "2929635 Lecture notes in control and information scien... 2\n", "2929646 Critical Studies in Dance Leadership and Inclu... 2\n", "2929691 Nigerian Journal of Wildlife Management 2\n", "2929702 Verzeichniss der Kunstwerke lebender Künstler,... 2\n", "\n", "[194241 rows x 2 columns]" ] }, "execution_count": 26, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "If a name matches a repeated name exactly or fuzzy matches to a repeated name and there is not other information available, the match status must be ambigious." ] }, { "cell_type": "code", "execution_count": 28, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['2735-928X', '2735-9298']" ] }, "execution_count": 28, "metadata": {}, "output_type": "execute_result" } ], "source": [ "mapping[\"Nigerian Journal of Wildlife Management\"]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "These are two ISSN refering to the same journal." ] }, { "cell_type": "code", "execution_count": 29, "metadata": {}, "outputs": [], "source": [ "import requests" ] }, { "cell_type": "code", "execution_count": 30, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "{'@graph': [{'@id': 'http://id.loc.gov/vocabulary/countries/nr',\n", " 'label': 'Nigeria'},\n", " {'@id': 'organization/ISSNCenter#_e',\n", " '@type': 'http://schema.org/Organization'},\n", " {'@id': 'resource/ISSN-L/2735-9298',\n", " 'identifiedBy': 'resource/ISSN/2735-928X#ISSN-L'},\n", " {'@id': 'resource/ISSN/2735-928X',\n", " '@type': ['http://id.loc.gov/ontologies/bibframe/Work',\n", " 'http://id.loc.gov/ontologies/bibframe/Instance',\n", " 'http://schema.org/Periodical'],\n", " 'identifiedBy': ['resource/ISSN/2735-928X#ISSN-L',\n", " 'resource/ISSN/2735-928X#ISSN',\n", " 'resource/ISSN/2735-928X#KeyTitle'],\n", " 'mainTitle': 'Nigerian Journal of Wildlife Management',\n", " 'otherPhysicalFormat': 'resource/ISSN/2735-9298',\n", " 'title': 'resource/ISSN/2735-928X#KeyTitle',\n", " 'format': 'vocabularies/medium#Online',\n", " 'identifier': '2735-928X',\n", " 'isFormatOf': 'resource/ISSN/2735-9298',\n", " 'type': 'http://marc21rdf.info/terms/formofmaterial#a',\n", " 'http://purl.org/ontology/bibo/issn': '2735-928X',\n", " 'isPartOf': 'resource/ISSN-L/2735-9298',\n", " 'issn': '2735-928X',\n", " 'name': ['Nigerian JOurnal of Wildlife Management (Ondo. Online)',\n", " 'Nigerian Journal of Wildlife Management'],\n", " 'publication': 'resource/ISSN/2735-928X#ReferencePublicationEvent',\n", " 'url': 'http://www.wildlifesociety.ng.org/'},\n", " {'@id': 'resource/ISSN/2735-928X#ISSN',\n", " '@type': 'http://id.loc.gov/ontologies/bibframe/Issn',\n", " 'status': 'vocabularies/IdentifierStatus#Valid',\n", " 'value': '2735-928X'},\n", " {'@id': 'resource/ISSN/2735-928X#ISSN-L',\n", " '@type': 'http://id.loc.gov/ontologies/bibframe/IssnL',\n", " 'status': 'vocabularies/IdentifierStatus#Valid',\n", " 'value': '2735-9298'},\n", " {'@id': 'resource/ISSN/2735-928X#KeyTitle',\n", " '@type': ['http://id.loc.gov/ontologies/bibframe/Identifier',\n", " 'http://id.loc.gov/ontologies/bibframe/KeyTitle'],\n", " 'value': 'Nigerian JOurnal of Wildlife Management (Ondo. Online)'},\n", " {'@id': 'resource/ISSN/2735-928X#Record',\n", " '@type': 'http://schema.org/CreativeWork',\n", " 'status': 'vocabularies/RecordStatus#Register',\n", " 'modified': '20200808163600.0',\n", " 'mainEntity': 'resource/ISSN/2735-928X',\n", " 'wasAttributedTo': 'organization/ISSNCenter#_e'},\n", " {'@id': 'resource/ISSN/2735-928X#ReferencePublicationEvent',\n", " '@type': 'http://schema.org/PublicationEvent',\n", " 'location': 'http://id.loc.gov/vocabulary/countries/nr'}],\n", " '@context': {'status': {'@id': 'http://id.loc.gov/ontologies/bibframe/status',\n", " '@type': '@id'},\n", " 'value': {'@id': 'http://www.w3.org/1999/02/22-rdf-syntax-ns#value'},\n", " 'publication': {'@id': 'http://schema.org/publication', '@type': '@id'},\n", " 'mainTitle': {'@id': 'http://id.loc.gov/ontologies/bibframe/mainTitle'},\n", " 'title': {'@id': 'http://id.loc.gov/ontologies/bibframe/title',\n", " '@type': '@id'},\n", " 'name': {'@id': 'http://schema.org/name'},\n", " 'issn': {'@id': 'http://schema.org/issn'},\n", " 'format': {'@id': 'http://purl.org/dc/elements/1.1/format', '@type': '@id'},\n", " 'url': {'@id': 'http://schema.org/url'},\n", " 'identifiedBy': {'@id': 'http://id.loc.gov/ontologies/bibframe/identifiedBy',\n", " '@type': '@id'},\n", " 'otherPhysicalFormat': {'@id': 'http://id.loc.gov/ontologies/bibframe/otherPhysicalFormat',\n", " '@type': '@id'},\n", " 'isPartOf': {'@id': 'http://schema.org/isPartOf', '@type': '@id'},\n", " 'type': {'@id': 'http://purl.org/dc/terms/type', '@type': '@id'},\n", " 'identifier': {'@id': 'http://purl.org/dc/elements/1.1/identifier'},\n", " 'isFormatOf': {'@id': 'http://purl.org/dc/terms/isFormatOf', '@type': '@id'},\n", " 'wasAttributedTo': {'@id': 'http://www.w3.org/ns/prov#wasAttributedTo',\n", " '@type': '@id'},\n", " 'mainEntity': {'@id': 'http://schema.org/mainEntity', '@type': '@id'},\n", " 'modified': {'@id': 'http://purl.org/dc/terms/modified',\n", " '@type': 'http://www.w3.org/2001/XMLSchema#dateTime'},\n", " 'location': {'@id': 'http://schema.org/location', '@type': '@id'},\n", " 'label': {'@id': 'http://www.w3.org/2000/01/rdf-schema#label'}}}" ] }, "execution_count": 30, "metadata": {}, "output_type": "execute_result" } ], "source": [ "requests.get(\"https://portal.issn.org/resource/ISSN/2735-928X?format=json\").json()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Interestingly, most (93%) journal do not seem to have the distinction between electronic and print. " ] }, { "cell_type": "code", "execution_count": 32, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "0.9336999659012597" ] }, "execution_count": 32, "metadata": {}, "output_type": "execute_result" } ], "source": [ "len(unique_name) / len(df)" ] }, { "cell_type": "code", "execution_count": 35, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
702322International accounting and auditing standards1
39992Varaždinske vijesti (Tisak)1
1722785Nihon Shuwa Gakkai Taikai yokoushuu1
2598901Solidarité laïque1
2408544AUP!1
1785394˜The œASHRAE handbook CD. HVAC applications1
325078Kampō iyaku1
49595Fieldiana. Anthropology1
110831Vyčislitelʹnaâ matematika i matematičeskaâ fiz...1
2880661The Assembly Reporters Africa ( Abuja)1
\n", "
" ], "text/plain": [ " name issn_count\n", "702322 International accounting and auditing standards 1\n", "39992 Varaždinske vijesti (Tisak) 1\n", "1722785 Nihon Shuwa Gakkai Taikai yokoushuu 1\n", "2598901 Solidarité laïque 1\n", "2408544 AUP! 1\n", "1785394 ˜The œASHRAE handbook CD. HVAC applications 1\n", "325078 Kampō iyaku 1\n", "49595 Fieldiana. Anthropology 1\n", "110831 Vyčislitelʹnaâ matematika i matematičeskaâ fiz... 1\n", "2880661 The Assembly Reporters Africa ( Abuja) 1" ] }, "execution_count": 35, "metadata": {}, "output_type": "execute_result" } ], "source": [ "unique_name.sample(n=10)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Example \"Fieldiana\"\n", "\n", "* \"Fieldiana. Anthropology\"\n", "* https://www.jstor.org/journal/fieldianaanthro" ] }, { "cell_type": "code", "execution_count": 36, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['0071-4739']" ] }, "execution_count": 36, "metadata": {}, "output_type": "execute_result" } ], "source": [ "mapping[\"Fieldiana. Anthropology\"]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "However, jstor reports two ISSN: INSSP: 0071-4739, EISSN: 2162-4321 - but ISSN.org does not know about it?" ] }, { "cell_type": "code", "execution_count": 47, "metadata": {}, "outputs": [], "source": [ "unique_issn = set([item for v in mapping.values() for item in v])" ] }, { "cell_type": "code", "execution_count": 48, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "True" ] }, "execution_count": 48, "metadata": {}, "output_type": "execute_result" } ], "source": [ "\"2162-4321\" in unique_issn" ] }, { "cell_type": "code", "execution_count": 51, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Fieldiana. ['2163-7105', '2162-4321', '0096-0438', '0015-0746', '0096-2651', '0097-3572', '2162-4267', '0015-0754', '0071-4739', '2162-4348', '2162-4291', '2158-5520']\n", "Fieldiana. Anthropology (Online) ['2162-4321']\n" ] } ], "source": [ "for k, v in mapping.items():\n", " if \"2162-4321\" in v:\n", " print(k, v)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "As an example: when given a string like \"Fieldiana\" we would return ambiguous. But \"Fieldiana. (Online)\" might be matched to '2162-4321'. " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Other Examples" ] }, { "cell_type": "code", "execution_count": 53, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
2190239Mediazine.4
396642San Bernardino County popular street atlas3
510019˜L'œesquisse.2
1517119Developing mental health.2
1853045Jan Dix2
1021374Rennes 1 campus.2
746313Russian journal of coordination chemistry.2
1087451Pomorac.2
109465Bloknot agitatora.36
1154969Estonia :2
\n", "
" ], "text/plain": [ " name issn_count\n", "2190239 Mediazine. 4\n", "396642 San Bernardino County popular street atlas 3\n", "510019 ˜L'œesquisse. 2\n", "1517119 Developing mental health. 2\n", "1853045 Jan Dix 2\n", "1021374 Rennes 1 campus. 2\n", "746313 Russian journal of coordination chemistry. 2\n", "1087451 Pomorac. 2\n", "109465 Bloknot agitatora. 36\n", "1154969 Estonia : 2" ] }, "execution_count": 53, "metadata": {}, "output_type": "execute_result" } ], "source": [ "repeated_names.sample(n=10)" ] }, { "cell_type": "code", "execution_count": 54, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['1608-3318', '1070-3284']" ] }, "execution_count": 54, "metadata": {}, "output_type": "execute_result" } ], "source": [ "mapping[\"Russian journal of coordination chemistry.\"]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "A new question: How many journals are listed under different names, yet still refer to the same journal?" ] }, { "cell_type": "code", "execution_count": 57, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['0733-7167', '0733-7159', '0733-2289']" ] }, "execution_count": 57, "metadata": {}, "output_type": "execute_result" } ], "source": [ "mapping[\"San Bernardino County popular street atlas\"]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Journal of ..." ] }, { "cell_type": "code", "execution_count": 61, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
nameissn_count
977Journal of vocational behavior.1
978Journal of vocational behavior (Print)1
1206Journal of American Concrete Institute.1
1207Journal of the American Concrete Institute1
1310Journal of the American Dental Hygienists' Ass...1
.........
2929340Journal of surgical procedures and case reports1
2929448Journal of Epidermiological Society of Nigeria...1
2929449Journal of Epidermiological Society of Nigeria1
2929663Journal of Practicing Teachers1
2929664Journal of Practicing Teachers (Uyo. Online)1
\n", "

40190 rows × 2 columns

\n", "
" ], "text/plain": [ " name issn_count\n", "977 Journal of vocational behavior. 1\n", "978 Journal of vocational behavior (Print) 1\n", "1206 Journal of American Concrete Institute. 1\n", "1207 Journal of the American Concrete Institute 1\n", "1310 Journal of the American Dental Hygienists' Ass... 1\n", "... ... ...\n", "2929340 Journal of surgical procedures and case reports 1\n", "2929448 Journal of Epidermiological Society of Nigeria... 1\n", "2929449 Journal of Epidermiological Society of Nigeria 1\n", "2929663 Journal of Practicing Teachers 1\n", "2929664 Journal of Practicing Teachers (Uyo. Online) 1\n", "\n", "[40190 rows x 2 columns]" ] }, "execution_count": 61, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df[df.name.str.startswith(\"Journal of\")]" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.8" } }, "nbformat": 4, "nbformat_minor": 4 }