User:TJones (WMF)/Notes/Language Detection Evaluation/Results by Language Count

Languages reported by count
Actual: English (599)	Spanish (43)	Chinese (20)	Portuguese (19)	Arabic (10) French (10)	Tagalog (9)	German (8)	Malay (6)	Russian (5) Turkish (5)	Indonesian (4)	Persian (4)	Swahili (4)	Korean (3) Bengali (2)	Bulgarian (2)	Hindi (2)	Italian (2)	Norwegian (2) Croatian (1)	Dutch (1)	Estonian (1)	Finnish (1)	Greek (1) Hmong (1)	Japanese (1)	Kannada (1)	Latin (1)	Polish (1) Serbian (1)	Somali (1)	Swedish (1)	Tamil (1)	Thai (1) Uzbek (1) 1: English (267)	Romanian (71)	Italian (55)	Spanish (55)	French (54) Tagalog (41)	German (33)	Portuguese (27)	Indonesian (24)	Chinese (20) Dutch (14)	Albanian (14)	Norwegian (14)	Arabic (9)	Danish (8) Finnish (8)	Swedish (7)	Estonian (7)	Turkish (6)	Lithuanian (6) Polish (5)	Persian (4)	Croatian (3)	Russian (3)	Korean (3) Hindi (2)	Bengali (2)	Hungarian (2)	Macedonian (2)	Bulgarian (2) Czech (2)	Ukrainian (1)	Japanese (1)	Tamil (1)	Greek (1) Thai (1) 2: English (320)	Romanian (92)	Italian (84)	French (78)	Spanish (73) Tagalog (56)	German (45)	Indonesian (32)	Portuguese (30)	Albanian (21) Chinese (20)	Dutch (20)	Norwegian (18)	Estonian (16)	Danish (16) Swedish (14)	Finnish (11)	Lithuanian (10)	Croatian (10)	Arabic (9) Turkish (8)	Czech (6)	Polish (6)	Persian (5)	Macedonian (3) Korean (3)	Russian (3)	Bulgarian (2)	Hungarian (2)	Bengali (2) Hindi (2)	Thai (1)	Greek (1)	Latvian (1)	Tamil (1) Japanese (1)	Ukrainian (1) 3: English (325)	Romanian (96)	Italian (93)	French (82)	Spanish (78) Tagalog (59)	German (47)	Indonesian (35)	Portuguese (31)	Albanian (21) Dutch (21)	Chinese (20)	Danish (19)	Norwegian (18)	Estonian (18) Swedish (16)	Lithuanian (11)	Finnish (11)	Croatian (11)	Arabic (9) Turkish (8)	Czech (6)	Polish (6)	Persian (5)	Hungarian (4) Macedonian (3)	Russian (3)	Korean (3)	Bengali (2)	Latvian (2) Bulgarian (2)	Hindi (2)	Tamil (1)	Greek (1)	Thai (1) Ukrainian (1)	Japanese (1) 4: English (325)	Romanian (98)	Italian (93)	French (82)	Spanish (78) Tagalog (59)	German (47)	Indonesian (35)	Portuguese (31)	Albanian (21) Dutch (21)	Chinese (20)	Danish (19)	Norwegian (18)	Estonian (18) Swedish (16)	Croatian (12)	Lithuanian (11)	Finnish (11)	Arabic (9) Turkish (8)	Polish (7)	Czech (6)	Persian (5)	Hungarian (4) Macedonian (3)	Latvian (3)	Korean (3)	Russian (3)	Bulgarian (2) Bengali (2)	Hindi (2)	Thai (1)	Tamil (1)	Greek (1) Ukrainian (1)	Japanese (1)

Recall and precision by number of languages considered
thresh	f0.5	f1	f2	recall	prec	total	hits	misses TOTAL (775) 1	 49.3%	 49.3%	 49.3%	 49.3%	 49.3%	775	382	393 2	 45.4%	 49.2%	 53.6%	 57.0%	 43.2%	775	442	581 3	 44.2%	 48.5%	 53.7%	 57.8%	 41.8%	775	448	624 4	 44.1%	 48.4%	 53.6%	 57.8%	 41.6%	775	448	629 English (599) 1	 79.2%	 61.0%	 49.6%	 44.1%	 98.9%	599	264	3 2	 84.4%	 69.0%	 58.4%	 52.9%	 99.1%	599	317	3 3	 84.8%	 69.7%	 59.2%	 53.8%	 99.1%	599	322	3 4	 84.8%	 69.7%	 59.2%	 53.8%	 99.1%	599	322	3 Spanish (43) 1	 55.1%	 59.2%	 63.9%	 67.4%	 52.7%	43	29	26 2	 47.8%	 55.2%	 65.3%	 74.4%	 43.8%	43	32	41 3	 45.1%	 52.9%	 64.0%	 74.4%	 41.0%	43	32	46 4	 45.1%	 52.9%	 64.0%	 74.4%	 41.0%	43	32	46 Chinese (20) 1	100.0%	100.0%	100.0%	100.0%	100.0%	20	20	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	20	20	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	20	20	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	20	20	0 Portuguese (19) 1	 47.2%	 52.2%	 58.3%	 63.2%	 44.4%	19	12	15 2	 46.8%	 53.1%	 61.3%	 68.4%	 43.3%	19	13	17 3	 49.0%	 56.0%	 65.4%	 73.7%	 45.2%	19	14	17 4	 49.0%	 56.0%	 65.4%	 73.7%	 45.2%	19	14	17 Arabic (10) 1	 87.0%	 84.2%	 81.6%	 80.0%	 88.9%	10	8	1 2	 87.0%	 84.2%	 81.6%	 80.0%	 88.9%	10	8	1 3	 87.0%	 84.2%	 81.6%	 80.0%	 88.9%	10	8	1 4	 87.0%	 84.2%	 81.6%	 80.0%	 88.9%	10	8	1 French (10) 1	 6.6%	  9.4%	 16.0%	 30.0%	  5.6%	10	3	51 2	  7.8%	 11.4%	 21.2%	 50.0%	  6.4%	10	5	73 3	  7.4%	 10.9%	 20.5%	 50.0%	  6.1%	10	5	77 4	  7.4%	 10.9%	 20.5%	 50.0%	  6.1%	10	5	77 Tagalog (9) 1	 23.1%	 32.0%	 51.9%	 88.9%	 19.5%	9	8	33 2	 17.2%	 24.6%	 43.5%	 88.9%	 14.3%	9	8	48 3	 16.3%	 23.5%	 42.1%	 88.9%	 13.6%	9	8	51 4	 16.3%	 23.5%	 42.1%	 88.9%	 13.6%	9	8	51 German (8) 1	 25.0%	 34.1%	 53.8%	 87.5%	 21.2%	8	7	26 2	 18.6%	 26.4%	 45.5%	 87.5%	 15.6%	8	7	38 3	 17.9%	 25.5%	 44.3%	 87.5%	 14.9%	8	7	40 4	 17.9%	 25.5%	 44.3%	 87.5%	 14.9%	8	7	40 Malay (6) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	6	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	6	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	6	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	6	0	0 Russian (5) 1	 88.2%	 75.0%	 65.2%	 60.0%	100.0%	5	3	0 2	 88.2%	 75.0%	 65.2%	 60.0%	100.0%	5	3	0 3	 88.2%	 75.0%	 65.2%	 60.0%	100.0%	5	3	0 4	 88.2%	 75.0%	 65.2%	 60.0%	100.0%	5	3	0 Turkish (5) 1	 51.7%	 54.5%	 57.7%	 60.0%	 50.0%	5	3	3 2	 40.5%	 46.2%	 53.6%	 60.0%	 37.5%	5	3	5 3	 40.5%	 46.2%	 53.6%	 60.0%	 37.5%	5	3	5 4	 40.5%	 46.2%	 53.6%	 60.0%	 37.5%	5	3	5 Indonesian (4) 1	 20.0%	 28.6%	 50.0%	100.0%	 16.7%	4	4	20 2	 15.2%	 22.2%	 41.7%	100.0%	 12.5%	4	4	28 3	 13.9%	 20.5%	 39.2%	100.0%	 11.4%	4	4	31 4	 13.9%	 20.5%	 39.2%	100.0%	 11.4%	4	4	31 Persian (4) 1	 75.0%	 75.0%	 75.0%	 75.0%	 75.0%	4	3	1 2	 83.3%	 88.9%	 95.2%	100.0%	 80.0%	4	4	1 3	 83.3%	 88.9%	 95.2%	100.0%	 80.0%	4	4	1 4	 83.3%	 88.9%	 95.2%	100.0%	 80.0%	4	4	1 Swahili (4) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	4	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	4	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	4	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	4	0	0 Korean (3) 1	100.0%	100.0%	100.0%	100.0%	100.0%	3	3	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	3	3	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	3	3	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	3	3	0 Bengali (2) 1	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 Bulgarian (2) 1	 50.0%	 50.0%	 50.0%	 50.0%	 50.0%	2	1	1 2	 50.0%	 50.0%	 50.0%	 50.0%	 50.0%	2	1	1 3	 50.0%	 50.0%	 50.0%	 50.0%	 50.0%	2	1	1 4	 50.0%	 50.0%	 50.0%	 50.0%	 50.0%	2	1	1 Hindi (2) 1	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	2	2	0 Italian (2) 1	 4.5%	  7.0%	 15.9%	100.0%	  3.6%	2	2	53 2	  3.0%	  4.7%	 10.9%	100.0%	  2.4%	2	2	82 3	  2.7%	  4.2%	  9.9%	100.0%	  2.2%	2	2	91 4	  2.7%	  4.2%	  9.9%	100.0%	  2.2%	2	2	91 Norwegian (2) 1	 8.6%	 12.5%	 22.7%	 50.0%	  7.1%	2	1	13 2	  6.8%	 10.0%	 19.2%	 50.0%	  5.6%	2	1	17 3	  6.8%	 10.0%	 19.2%	 50.0%	  5.6%	2	1	17 4	  6.8%	 10.0%	 19.2%	 50.0%	  5.6%	2	1	17 Croatian (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	3 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	10 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	11 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	12 Dutch (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	14 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	20 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	21 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	21 Estonian (1) 1	 17.2%	 25.0%	 45.5%	100.0%	 14.3%	1	1	6 2	 7.7%	 11.8%	 25.0%	100.0%	  6.2%	1	1	15 3	  6.8%	 10.5%	 22.7%	100.0%	  5.6%	1	1	17 4	  6.8%	 10.5%	 22.7%	100.0%	  5.6%	1	1	17 Finnish (1) 1	 15.2%	 22.2%	 41.7%	100.0%	 12.5%	1	1	7 2	 11.1%	 16.7%	 33.3%	100.0%	 9.1%	1	1	10 3	 11.1%	 16.7%	 33.3%	100.0%	  9.1%	1	1	10 4	 11.1%	 16.7%	 33.3%	100.0%	  9.1%	1	1	10 Greek (1) 1	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 Hmong (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 Japanese (1) 1	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 Kannada (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 Latin (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 Polish (1) 1	 23.8%	 33.3%	 55.6%	100.0%	 20.0%	1	1	4 2	 20.0%	 28.6%	 50.0%	100.0%	 16.7%	1	1	5 3	 20.0%	 28.6%	 50.0%	100.0%	 16.7%	1	1	5 4	 17.2%	 25.0%	 45.5%	100.0%	 14.3%	1	1	6 Serbian (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 Somali (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 Swedish (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	7 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	14 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	16 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	16 Tamil (1) 1	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 Thai (1) 1	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 2	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 3	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 4	100.0%	100.0%	100.0%	100.0%	100.0%	1	1	0 Uzbek (1) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	1	0	0 Albanian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	14 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	21 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	21 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	21 Czech (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	2 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	6 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	6 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	6 Danish (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	8 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	16 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	19 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	19 Hungarian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	2 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	2 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	4 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	4 Latvian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	0 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	1 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	2 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	3 Lithuanian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	6 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	10 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	11 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	11 Macedonian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	2 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	3 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	3 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	3 Romanian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	71 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	92 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	96 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	98 Ukrainian (0) 1	 0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	1 2	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	1 3	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	1 4	  0.0%	  0.0%	  0.0%	  0.0%	  0.0%	0	0	1 thresh	f0.5	f1	f2	recall	prec	total	hits	misses

Most frequent incorrect ID by language
English (599) 1	Romanian (65)	French (50)	Italian (47)	Tagalog (26)	German (22) Spanish (20)	Dutch (14)	Albanian (13)	Indonesian (13)	Norwegian (12) Portuguese (11)	Danish (8)	Finnish (7)	Swedish (7)	Lithuanian (5) Estonian (4)	Polish (4)	Croatian (3)	Czech (2)	Hungarian (2) Turkish (1) 2	Romanian (82)	Italian (74)	French (70)	Tagalog (39)	German (34) Spanish (33)	Albanian (19)	Dutch (19)	Indonesian (18)	Danish (16) Norwegian (16)	Portuguese (13)	Swedish (13)	Estonian (12)	Croatian (9) Finnish (9)	Lithuanian (9)	Czech (6)	Polish (5)	Hungarian (2) Turkish (2)	Latvian (1) 3	Romanian (85)	Italian (79)	French (74)	Tagalog (42)	Spanish (37) German (36)	Indonesian (21)	Albanian (19)	Danish (19)	Dutch (19) Norwegian (16)	Swedish (15)	Estonian (14)	Portuguese (13)	Croatian (10) Finnish (9)	Lithuanian (9)	Czech (6)	Polish (5)	Hungarian (4) Latvian (2)	Turkish (2) 4	Romanian (87)	Italian (79)	French (74)	Tagalog (42)	Spanish (37) German (36)	Indonesian (21)	Albanian (19)	Danish (19)	Dutch (19) Norwegian (16)	Swedish (15)	Estonian (14)	Portuguese (13)	Croatian (10) Finnish (9)	Lithuanian (9)	Czech (6)	Polish (6)	Hungarian (4) Latvian (3)	Turkish (2) Spanish (43) 1	Romanian (4)	Italian (3)	German (2)	Portuguese (2)	English (1) Lithuanian (1)	Tagalog (1) 2	Romanian (6)	Italian (3)	German (2)	Portuguese (2)	English (1) Estonian (1)	Finnish (1)	French (1)	Lithuanian (1)	Tagalog (1) 3	Romanian (7)	Italian (5)	German (2)	Portuguese (2)	English (1) Estonian (1)	Finnish (1)	French (1)	Lithuanian (1)	Tagalog (1) 4	Romanian (7)	Italian (5)	German (2)	Portuguese (2)	Croatian (1) English (1)	Estonian (1)	Finnish (1)	French (1)	Lithuanian (1) Tagalog (1) Chinese (20) 1 2	Dutch (1)	Tagalog (1) 3	Dutch (1)	Tagalog (1) 4	Dutch (1)	Tagalog (1) Portuguese (19) 1	Spanish (5)	Italian (1)	Romanian (1) 2	Spanish (7)	Italian (3)	Romanian (2)	French (1) 3	Spanish (8)	Italian (3)	Romanian (2)	French (1) 4	Spanish (8)	Italian (3)	Romanian (2)	French (1) Arabic (10) 1	English (1)	Persian (1) 2	Albanian (1)	English (1)	Persian (1) 3	Albanian (1)	English (1)	Persian (1) 4	Albanian (1)	English (1)	Persian (1) French (10) 1	German (2)	English (1)	Estonian (1)	Italian (1)	Romanian (1) Tagalog (1) 2	German (2)	Romanian (2)	English (1)	Estonian (1)	Italian (1) Tagalog (1) 3	German (2)	Italian (2)	Romanian (2)	English (1)	Estonian (1) Tagalog (1) 4	German (2)	Italian (2)	Romanian (2)	English (1)	Estonian (1) Tagalog (1) Tagalog (9) 1	Italian (1) 2	Italian (1) 3	Italian (1) 4	Italian (1) German (8) 1	Tagalog (1) 2	Tagalog (2)	Indonesian (1)	Swedish (1) 3	Tagalog (2)	Indonesian (1)	Lithuanian (1)	Swedish (1) 4	Tagalog (2)	Indonesian (1)	Lithuanian (1)	Swedish (1) Malay (6) 1	Indonesian (5)	Tagalog (1) 2	Indonesian (6)	Tagalog (1) 3	Indonesian (6)	Italian (1)	Tagalog (1) 4	Indonesian (6)	Italian (1)	Tagalog (1) Russian (5) 1	Macedonian (1)	Ukrainian (1) 2	Macedonian (1)	Ukrainian (1) 3	Macedonian (1)	Ukrainian (1) 4	Macedonian (1)	Ukrainian (1) Turkish (5) 1	Estonian (1)	Tagalog (1) 2	Estonian (1)	Tagalog (1) 3	Estonian (1)	Tagalog (1) 4	Estonian (1)	Tagalog (1) Persian (4) 1	Arabic (1) 2	Arabic (1) 3	Arabic (1) 4	Arabic (1) Swahili (4) 1	Indonesian (2)	Tagalog (2) 2	Indonesian (3)	Tagalog (2)	Croatian (1)	Turkish (1) 3	Indonesian (3)	Tagalog (2)	Croatian (1)	Dutch (1)	Turkish (1) 4	Indonesian (3)	Tagalog (2)	Croatian (1)	Dutch (1)	Turkish (1) Bulgarian (2) 1	Macedonian (1) 2	Macedonian (1) 3	Macedonian (1) 4	Macedonian (1) Norwegian (2) 1	Portuguese (1) 2	Portuguese (1) 3	Portuguese (1) 4	Portuguese (1) Croatian (1) 1	Spanish (1) 2	Spanish (1) 3	Spanish (1) 4	Spanish (1) Dutch (1) 1	French (1) 2	French (1) 3	French (1) 4	French (1) Hmong (1) 1	Albanian (1) 2	Albanian (1) 3	Albanian (1) 4	Albanian (1) Latin (1) 1	Portuguese (1) 2	Portuguese (1) 3	Portuguese (1) 4	Portuguese (1) Serbian (1) 1	Bulgarian (1) 2	Bulgarian (1)	Macedonian (1) 3	Bulgarian (1)	Macedonian (1) 4	Bulgarian (1)	Macedonian (1) Somali (1) 1	Turkish (1) 2	Turkish (1) 3	Turkish (1) 4	Turkish (1) Swedish (1) 1	Norwegian (1) 2	Norwegian (1) 3	Norwegian (1) 4	Norwegian (1) Uzbek (1) 1	Turkish (1) 2	Turkish (1) 3	Turkish (1) 4	Turkish (1)

Most frequent ID for non-languages
Name (361) 1	English (50)	German (36)	Tagalog (36)	Italian (33)	French (30) Indonesian (26)	Romanian (21)	Spanish (16)	Finnish (11)	Turkish (11) Norwegian (10)	Albanian (9)	Croatian (9)	Dutch (9)	Portuguese (9) Danish (8)	Estonian (7)	Swedish (7)	Hungarian (6)	Lithuanian (6) Latvian (4)	Polish (4)	Vietnamese (2)	Chinese (1) 2	English (71)	Tagalog (54)	German (48)	Indonesian (40)	Italian (40) French (35)	Romanian (32)	Spanish (21)	Croatian (18)	Portuguese (16) Finnish (15)	Turkish (15)	Danish (13)	Dutch (13)	Norwegian (13) Swedish (13)	Albanian (11)	Estonian (10)	Lithuanian (10)	Hungarian (7) Latvian (6)	Polish (6)	Vietnamese (2)	Chinese (1) 3	English (71)	Tagalog (57)	German (53)	Indonesian (46)	Italian (42) French (37)	Romanian (33)	Spanish (25)	Croatian (19)	Finnish (17) Portuguese (16)	Turkish (16)	Danish (14)	Swedish (14)	Dutch (13) Norwegian (13)	Albanian (12)	Estonian (11)	Lithuanian (10)	Hungarian (7) Polish (7)	Latvian (6)	Vietnamese (2)	Chinese (1)	Czech (1) 4	English (71)	Tagalog (57)	German (53)	Indonesian (46)	Italian (43) French (37)	Romanian (33)	Spanish (26)	Croatian (19)	Finnish (17) Portuguese (16)	Turkish (16)	Danish (15)	Swedish (15)	Albanian (13) Dutch (13)	Norwegian (13)	Estonian (11)	Lithuanian (10)	Hungarian (7) Polish (7)	Latvian (6)	Vietnamese (2)	Chinese (1)	Czech (1) ?? (69) 1	Indonesian (11)	English (10)	Tagalog (10)	Romanian (5)	Dutch (4) Albanian (3)	German (3)	Polish (3)	Portuguese (3)	Spanish (3) Croatian (2)	Estonian (2)	Italian (2)	Norwegian (2)	Bulgarian (1) Danish (1)	Finnish (1)	French (1)	Hungarian (1)	Swedish (1) 2	Tagalog (15)	English (12)	Indonesian (11)	Albanian (8)	Romanian (6) Croatian (5)	Dutch (5)	Estonian (5)	Portuguese (5)	Polish (4) Spanish (4)	German (3)	Italian (3)	Norwegian (3)	Swedish (3) Danish (2)	Finnish (2)	French (2)	Hungarian (2)	Bulgarian (1) 3	Tagalog (15)	English (12)	Indonesian (11)	Albanian (9)	Romanian (7) Croatian (5)	Dutch (5)	Estonian (5)	Portuguese (5)	Polish (4) Spanish (4)	French (3)	German (3)	Italian (3)	Norwegian (3) Swedish (3)	Danish (2)	Finnish (2)	Hungarian (2)	Bulgarian (1) 4	Tagalog (15)	English (12)	Indonesian (11)	Albanian (9)	Romanian (7) Croatian (5)	Dutch (5)	Estonian (5)	Portuguese (5)	Polish (4) Spanish (4)	French (3)	German (3)	Italian (3)	Norwegian (3) Swedish (3)	Danish (2)	Finnish (2)	Hungarian (2)	Bulgarian (1) URL (67) 1	English (29)	French (6)	Italian (6)	Tagalog (5)	Portuguese (4) Chinese (2)	Croatian (2)	Dutch (2)	German (2)	Hungarian (2) Polish (2)	Czech (1)	Estonian (1)	Norwegian (1)	Romanian (1) Spanish (1) 2	English (32)	Italian (9)	French (7)	Portuguese (7)	Tagalog (5) German (4)	Polish (4)	Croatian (3)	Romanian (3)	Chinese (2) Dutch (2)	Estonian (2)	Hungarian (2)	Czech (1)	Norwegian (1) Spanish (1)	Swedish (1)	Turkish (1) 3	English (33)	Italian (9)	French (7)	Portuguese (7)	Tagalog (5) German (4)	Polish (4)	Croatian (3)	Dutch (3)	Romanian (3) Chinese (2)	Estonian (2)	Hungarian (2)	Spanish (2)	Czech (1) Danish (1)	Norwegian (1)	Swedish (1)	Turkish (1) 4	English (33)	Italian (9)	French (7)	Portuguese (7)	Tagalog (5) German (4)	Polish (4)	Croatian (3)	Dutch (3)	Romanian (3) Chinese (2)	Estonian (2)	Hungarian (2)	Spanish (2)	Czech (1) Danish (1)	Norwegian (1)	Swedish (1)	Turkish (1) Junk (46) 1	Albanian (7)	English (5)	Hungarian (5)	Dutch (4)	French (4) Polish (3)	Portuguese (3)	Estonian (2)	Indonesian (2)	Italian (2) Norwegian (2)	Swedish (2)	Danish (1)	Finnish (1)	Latvian (1) Romanian (1)	Tagalog (1) 2	Albanian (8)	Polish (8)	English (7)	Hungarian (7)	Dutch (6) Portuguese (5)	French (4)	Norwegian (4)	Indonesian (3)	Swedish (3) Danish (2)	Estonian (2)	Finnish (2)	Italian (2)	Croatian (1) Czech (1)	German (1)	Latvian (1)	Romanian (1)	Tagalog (1) 3	Albanian (9)	Polish (9)	English (7)	Hungarian (7)	Dutch (6) Norwegian (5)	Portuguese (5)	French (4)	Indonesian (3)	Swedish (3) Danish (2)	Estonian (2)	Finnish (2)	Italian (2)	Tagalog (2) Croatian (1)	Czech (1)	German (1)	Latvian (1)	Romanian (1) 4	Albanian (9)	Polish (9)	English (7)	Hungarian (7)	Dutch (6) Norwegian (5)	Portuguese (5)	French (4)	Indonesian (3)	Swedish (3) Danish (2)	Estonian (2)	Finnish (2)	Italian (2)	Tagalog (2) Croatian (1)	Czech (1)	German (1)	Latvian (1)	Romanian (1) DOI (33) 1	French (11)	English (5)	Croatian (3)	Romanian (3)	Albanian (2) Estonian (2)	German (2)	Polish (1)	Spanish (1) 2	French (13)	English (7)	Albanian (5)	Danish (4)	Croatian (3) German (3)	Romanian (3)	Estonian (2)	Spanish (2)	Finnish (1) Indonesian (1)	Polish (1)	Vietnamese (1) 3	French (13)	English (7)	Albanian (5)	Danish (4)	Croatian (3) German (3)	Romanian (3)	Estonian (2)	Spanish (2)	Finnish (1) Indonesian (1)	Polish (1)	Vietnamese (1) 4	French (13)	English (7)	Albanian (5)	Danish (4)	Croatian (3) German (3)	Romanian (3)	Estonian (2)	Spanish (2)	Finnish (1) Indonesian (1)	Polish (1)	Vietnamese (1) User (16) 1	Croatian (2)	English (2)	Romanian (2)	Spanish (2)	Dutch (1) Estonian (1)	German (1)	Indonesian (1)	Italian (1)	Lithuanian (1) Tagalog (1)	Turkish (1) 2	Tagalog (3)	Croatian (2)	English (2)	Estonian (2)	Indonesian (2) Romanian (2)	Spanish (2)	Dutch (1)	French (1)	German (1) Italian (1)	Lithuanian (1)	Polish (1)	Turkish (1) 3	Tagalog (4)	Croatian (3)	English (2)	Estonian (2)	Indonesian (2) Polish (2)	Romanian (2)	Spanish (2)	Dutch (1)	French (1) German (1)	Italian (1)	Lithuanian (1)	Turkish (1) 4	Tagalog (4)	Croatian (3)	English (2)	Estonian (2)	Indonesian (2) Polish (2)	Romanian (2)	Spanish (2)	Dutch (1)	French (1) German (1)	Italian (1)	Lithuanian (1)	Turkish (1) Species (13) 1	English (3)	Italian (3)	Romanian (2)	Estonian (1)	French (1) Lithuanian (1)	Portuguese (1)	Tagalog (1) 2	English (4)	Romanian (4)	Italian (3)	Spanish (2)	Estonian (1) French (1)	Lithuanian (1)	Portuguese (1)	Tagalog (1) 3	English (4)	Romanian (4)	Italian (3)	Spanish (2)	Estonian (1) French (1)	Lithuanian (1)	Portuguese (1)	Tagalog (1) 4	English (4)	Romanian (4)	Italian (3)	Spanish (2)	Estonian (1) French (1)	Lithuanian (1)	Portuguese (1)	Tagalog (1) Number (12) 1	Chinese (2)	Portuguese (1) 2	Chinese (2)	Portuguese (1) 3	Chinese (2)	Portuguese (1) 4	Chinese (2)	Portuguese (1) DevTrans (11) 1	Indonesian (3)	Tagalog (3)	Albanian (1)	Finnish (1)	German (1) Latvian (1)	Portuguese (1) 2	Indonesian (5)	Tagalog (3)	Albanian (2)	Finnish (1)	German (1) Latvian (1)	Portuguese (1)	Romanian (1) 3	Indonesian (5)	Tagalog (3)	Albanian (2)	Finnish (1)	German (1) Latvian (1)	Portuguese (1)	Romanian (1) 4	Indonesian (5)	Tagalog (3)	Albanian (2)	Finnish (1)	German (1) Latvian (1)	Portuguese (1)	Romanian (1) None (10) 1	English (4)	Albanian (1)	Danish (1)	French (1)	Portuguese (1) Romanian (1) 2	English (4)	Albanian (1)	Danish (1)	French (1)	German (1) Portuguese (1)	Romanian (1) 3	English (4)	Albanian (1)	Danish (1)	French (1)	German (1) Portuguese (1)	Romanian (1) 4	English (4)	Albanian (1)	Danish (1)	French (1)	German (1) Portuguese (1)	Romanian (1) OCR (10) 1	Dutch (2)	Romanian (2)	Albanian (1)	Estonian (1)	German (1) Italian (1)	Swedish (1)	Turkish (1) 2	Romanian (3)	Albanian (2)	Dutch (2)	Estonian (2)	German (1) Indonesian (1)	Italian (1)	Spanish (1)	Swedish (1)	Turkish (1) 3	Romanian (3)	Albanian (2)	Dutch (2)	Estonian (2)	German (1) Indonesian (1)	Italian (1)	Spanish (1)	Swedish (1)	Turkish (1) 4	Romanian (3)	Albanian (2)	Dutch (2)	Estonian (2)	German (1) Indonesian (1)	Italian (1)	Spanish (1)	Swedish (1)	Turkish (1) Mixed (6) 1	English (2)	Indonesian (2)	Arabic (1)	German (1) 2	Indonesian (3)	English (2)	Albanian (1)	Arabic (1)	Chinese (1) German (1) 3	Indonesian (3)	Albanian (2)	English (2)	Arabic (1)	Chinese (1) German (1) 4	Indonesian (3)	Albanian (2)	English (2)	Arabic (1)	Chinese (1) German (1) Emoji (5) 1	Hebrew (2) 2	Hebrew (2) 3	Hebrew (2) 4	Hebrew (2) Linked (4) 1	Chinese (1)	Croatian (1)	English (1)	Tagalog (1) 2	Chinese (1)	Croatian (1)	English (1)	French (1)	Tagalog (1) 3	Chinese (1)	Croatian (1)	English (1)	French (1)	Tagalog (1) 4	Chinese (1)	Croatian (1)	English (1)	French (1)	Tagalog (1) TamilTrans (4) 1	Tagalog (2)	Indonesian (1)	Lithuanian (1) 2	Indonesian (2)	Tagalog (2)	French (1)	Lithuanian (1)	Turkish (1) 3	Tagalog (3)	Indonesian (2)	French (1)	German (1)	Lithuanian (1) Turkish (1) 4	Tagalog (3)	Indonesian (2)	French (1)	German (1)	Lithuanian (1) Turkish (1) Abbrev (2) 1	Indonesian (1)	Italian (1) 2	Indonesian (1)	Italian (1) 3	Indonesian (1)	Italian (1) 4	Indonesian (1)	Italian (1) Email (2) 1	French (1)	Romanian (1) 2	French (1)	Indonesian (1)	Portuguese (1)	Romanian (1) 3	French (1)	Indonesian (1)	Portuguese (1)	Romanian (1) 4	French (1)	Indonesian (1)	Portuguese (1)	Romanian (1) GreekTrans (1) 1	Latvian (1) 2	Latvian (1) 3	Latvian (1) 4	Latvian (1) JapanTrans (1) 1	Croatian (1) 2	Croatian (1)	Tagalog (1) 3	Croatian (1)	Tagalog (1) 4	Croatian (1)	Tagalog (1) KoreanTrans (1) 1	Indonesian (1) 2	Estonian (1)	Indonesian (1) 3	Estonian (1)	Indonesian (1) 4	Estonian (1)	Indonesian (1) Symbol (1) 1	French (1) 2	French (1) 3	French (1) 4	French (1) TamilTran (1) 1	English (1) 2	English (1)	Finnish (1) 3	English (1)	Finnish (1)	Portuguese (1) 4	English (1)	Finnish (1)	Portuguese (1)