Adrian Velicu
8dd31a28ae
Update dictionaries (possibly_offensive flag)
...
Correctly encoding possibly offensive words with their correct
frequency and the possibly_offensive flag set.
Continuing to encode with zero frequency only distracters or
words that should never come up.
https://paste.googleplex.com/5167060875214848
Bug: 11031090
Change-Id: Ia394b1827f292ff8d4791cc2f3e6e50b5aff4cbe
2014-10-31 14:49:24 +09:00
Adrian Velicu
487a6a6949
Update dictionaries
...
>>> dictionaries/de_wordlist.combined.gz
Header :
date : 1393228134 <=> 1412325412
version : 44 <=> 52
Body :
Probability changed: kommen 0 -> 149
Added: Käsebrötchen 50
Added: Lädst 50
Added: Müllbeutel 50
Added: Theresienwiese 50
Added: Verdammtes 50
Added: Wurstbrötchen 50
Added: abgebe 50
Added: angucke 50
Added: async 20
Added: backends 20
Added: brate 50
Added: erschreckendes 50
Added: erwische 50
Added: fahrt 80
Added: fragst 100
Added: gepostet 50
Added: gewundert 80
Added: gucke 50
Added: hattet 50
Added: hinkriege 50
Added: hustet 50
Added: hättet 60
Added: irgendwer 60
Added: koche 50
Added: kriege 70
Added: lehrst 50
Added: motivierenden 50
Added: müsstest 50
Added: müsstet 50
Added: organisiere 50
Added: peilen 50
Added: probiere 50
Added: rede 50
Added: reserviere 50
Added: sag 120
Added: schickes 80
Added: schickst 90
Added: sitze 50
Added: standet 50
Added: stolpere 50
Added: stressig 50
Added: telefoniere 80
Added: wolltest 100
Added: wolltet 100
Added: würdet 100
Added: ziele 50
Added: ähnlich 50
Added: älteren 50
Added: übelriechend 80
Added: überholen 50
Added: überlege 50
Added: überlegen 50
Added: überlegt 50
Added: übermorgen 50
Added: übernachte 50
Added: überquert 50
Added: überstanden 50
Added: übrig 50
Added: übrigens 50
>>> dictionaries/en_GB_wordlist.combined.gz
Header :
date : 1402373154 <=> 1412325408
version : 47 <=> 52
Body :
Deleted: Pinterest 25
Added: Edamame 25
Added: Pinterest 25
Added: amd 0
>>> dictionaries/en_US_wordlist.combined.gz
Header :
date : 1402373154 <=> 1412325184
version : 47 <=> 52
Body :
Deleted: Pinterest 25
Added: Edamame 25
Added: Pinterest 25
Added: amd 0
>>> dictionaries/en_wordlist.combined.gz
Header :
date : 1402373178 <=> 1412325419
version : 47 <=> 52
Body :
Deleted: Pinterest 25
Added: Edamame 25
Added: Pinterest 25
Added: amd 0
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1404131686 <=> 1412325412
version : 49 <=> 52
Body :
Added: cállese 30
Added: mándame 30
Added: recupérate 35
>>> dictionaries/ro_wordlist.combined.gz
Header :
description : Româna <=> Română
date : 1408019089 <=> 1412325511
version : 50 <=> 52
Body :
!!!!!! Truncated. !!!!!!!
>>> dictionaries/ru_wordlist.combined.gz
Header :
date : 1406597821 <=> 1412325424
version : 50 <=> 52
Body :
Deleted: Агг 52
Deleted: ЗАГС 77
Deleted: КОНКАКАФ 19
Deleted: Монк 69
Probability changed: НКАО 13 -> 0
Probability changed: НКВД 46 -> 0
Probability changed: НКО 14 -> 0
Probability changed: НКР 22 -> 0
Deleted: НОМОС-БАНК 58
Deleted: ПДД 77
Probability changed: РНК 33 -> 0
Deleted: СМС 78
Probability changed: СНК 35 -> 0
Deleted: ТОО 14
Probability changed: ТЦ 85 -> 5
Probability changed: УНКВД 11 -> 0
Deleted: ФИО 65
Deleted: Эбля 49
Probability changed: асексуальность 59 -> 0
Probability changed: бисексуал 72 -> 0
Probability changed: бисексуалов 85 -> 0
Probability changed: бисексуальной 67 -> 0
Probability changed: бисексуальности 75 -> 0
Deleted: бумажке 94
Deleted: бумажку 104
Deleted: важней 86
Deleted: вероника 58
Deleted: вероники 54
Deleted: вероникой 29
Deleted: веронику 29
Deleted: влезет 94
Deleted: влезть 87
Deleted: врожденная 75
Deleted: врожденного 78
Deleted: врожденное 71
Deleted: врожденной 85
Deleted: врожденную 66
Deleted: врожденные 82
Deleted: врожденный 82
Deleted: врожденным 79
Deleted: врожденными 76
Deleted: врожденных 86
Probability changed: врождённая 68 -> 75
Probability changed: врождённое 69 -> 71
Probability changed: врождённой 80 -> 85
Probability changed: врождённые 78 -> 82
Probability changed: врождённый 77 -> 82
Probability changed: врождённым 74 -> 79
Probability changed: врождённых 80 -> 86
Probability changed: все-таки 113 -> 30
Deleted: вылезли 88
Deleted: г-же 65
Deleted: г-н 88
Deleted: г-на 88
Probability changed: га 135 -> 0
Probability changed: гг 160 -> 0
Probability changed: гетеросексуалов 73 -> 0
Probability changed: гетеросексуального 67 -> 0
Probability changed: гетеросексуальной 71 -> 0
Probability changed: гетеросексуальности 65 -> 0
Probability changed: гетеросексуальность 67 -> 0
Probability changed: гетеросексуальную 65 -> 0
Probability changed: гетеросексуальные 76 -> 0
Probability changed: гетеросексуальных 77 -> 0
Probability changed: гомосексуал 74 -> 0
Probability changed: гомосексуала 67 -> 0
Probability changed: гомосексуалам 75 -> 0
Probability changed: гомосексуалами 70 -> 0
Probability changed: гомосексуализм 91 -> 0
Probability changed: гомосексуализма 91 -> 0
Probability changed: гомосексуализме 74 -> 0
Probability changed: гомосексуализму 68 -> 0
Probability changed: гомосексуалист 80 -> 0
Probability changed: гомосексуалиста 72 -> 0
Probability changed: гомосексуалистам 69 -> 0
Probability changed: гомосексуалистами 69 -> 0
Probability changed: гомосексуалистов 94 -> 0
Probability changed: гомосексуалистом 78 -> 0
Probability changed: гомосексуалисты 77 -> 0
Probability changed: гомосексуалов 93 -> 0
Probability changed: гомосексуалом 65 -> 0
Probability changed: гомосексуалы 82 -> 0
Probability changed: гомосексуальная 70 -> 0
Probability changed: гомосексуального 78 -> 0
Probability changed: гомосексуальное 71 -> 0
Probability changed: гомосексуальной 93 -> 0
Probability changed: гомосексуальности 103 -> 0
Probability changed: гомосексуальность 100 -> 0
Probability changed: гомосексуальностью 73 -> 0
Probability changed: гомосексуальную 75 -> 0
Probability changed: гомосексуальные 92 -> 0
Probability changed: гомосексуальный 75 -> 0
Probability changed: гомосексуальным 74 -> 0
Probability changed: гомосексуальными 70 -> 0
Probability changed: гомосексуальных 91 -> 0
Probability changed: д-р 93 -> 0
Deleted: дада 72
Deleted: даша 55
Deleted: даши 47
Deleted: дашу 29
Probability changed: де 154 -> 30
Probability changed: др 156 -> 0
Deleted: зажги 92
Deleted: зажгу 89
Deleted: зажигай 95
Deleted: зажигаю 88
Probability changed: зоосексуальность 65 -> 0
Probability changed: иРНК 68 -> 0
Probability changed: кДНК 62 -> 0
Probability changed: кв 133 -> 0
Deleted: кио 49
Deleted: лег 91
Deleted: лезу 88
Deleted: лезь 91
Probability changed: ля 103 -> 30
Probability changed: мРНК 102 -> 0
Deleted: машка 29
Probability changed: микроРНК 65 -> 0
Deleted: мону 29
Probability changed: мтДНК 79 -> 0
Probability changed: мяРНК 65 -> 0
Deleted: нажрался 97
Deleted: налил 97
Deleted: налили 86
Probability changed: негетеросексуальной 73 -> 0
Probability changed: негетеросексуальный 73 -> 0
Deleted: орут 98
Deleted: отт 64
Deleted: паша 83
Deleted: паше 66
Deleted: пашей 69
Deleted: пашой 73
Deleted: подоконник 88
Deleted: подскажет 87
Deleted: подскажете 89
Deleted: подскажите 112
Deleted: покажите 95
Deleted: полезли 91
Probability changed: пр 129 -> 0
Probability changed: пре-мРНК 78 -> 0
Deleted: пресекся 73
Probability changed: рРНК 91 -> 0
Deleted: раздражённо 91
Deleted: сажусь 99
Deleted: саше 54
Probability changed: секс 106 -> 0
Probability changed: секс-символ 74 -> 0
Probability changed: секс-символов 65 -> 0
Probability changed: секс-символом 74 -> 0
Probability changed: секс-туризм 62 -> 0
Probability changed: секса 105 -> 0
Probability changed: сексе 93 -> 0
Deleted: секси 88
Probability changed: сексизм 63 -> 0
Probability changed: сексизма 72 -> 0
Probability changed: сексолог 75 -> 0
Probability changed: сексологии 80 -> 0
Probability changed: сексом 102 -> 0
Probability changed: сексу 80 -> 0
Probability changed: сексуальная 95 -> 0
Probability changed: сексуально 88 -> 0
Probability changed: сексуального 107 -> 0
Probability changed: сексуальное 98 -> 0
Probability changed: сексуальной 111 -> 0
Probability changed: сексуальном 84 -> 0
Probability changed: сексуальному 79 -> 0
Probability changed: сексуальности 99 -> 0
Probability changed: сексуальность 90 -> 0
Probability changed: сексуальностью 70 -> 0
Probability changed: сексуальную 95 -> 0
Probability changed: сексуальные 105 -> 0
Probability changed: сексуальный 91 -> 0
Probability changed: сексуальным 95 -> 0
Probability changed: сексуальными 84 -> 0
Probability changed: сексуальных 113 -> 0
Deleted: сете 78
Deleted: слезой 87
Deleted: соображаю 90
Probability changed: тРНК 86 -> 0
Deleted: тав 69
Probability changed: транссексуал 67 -> 0
Probability changed: транссексуалки 64 -> 0
Probability changed: транссексуалов 82 -> 0
Probability changed: транссексуалы 71 -> 0
Probability changed: транссексуальности 77 -> 0
Probability changed: транссексуальность 65 -> 0
Deleted: укажите 83
Probability changed: ул 137 -> 0
Deleted: устар 93
Deleted: эдак 99
Added: Вероника 58
Added: Вероники 54
Added: Вероникой 29
Added: Веронику 29
Added: Даша 55
Added: Даши 47
Added: Дашу 29
Added: Маш 57
Added: Машка 29
Added: Паша 83
Added: Паше 66
Added: Пашей 69
Added: Пашой 73
Added: Саше 54
Added: впросак 0
Added: врождённую 66
Added: втечение 0
Added: втечении 0
Added: лёг 97
Added: машу 80
Added: чтоли 0
Added: чтоль 0
Added: ща 0
Added: щас 0
>>> java/res/raw/main_de.dict
Header :
date : 1393228134 <=> 1412325412
version : 44 <=> 52
Body :
Probability changed: kommen 0 -> 149
Added: Käsebrötchen 50
Added: Lädst 50
Added: Müllbeutel 50
Added: Theresienwiese 50
Added: Verdammtes 50
Added: Wurstbrötchen 50
Added: abgebe 50
Added: angucke 50
Added: async 20
Added: backends 20
Added: brate 50
Added: erschreckendes 50
Added: erwische 50
Added: fahrt 80
Added: fragst 100
Added: gepostet 50
Added: gewundert 80
Added: gucke 50
Added: hattet 50
Added: hinkriege 50
Added: hustet 50
Added: hättet 60
Added: irgendwer 60
Added: koche 50
Added: kriege 70
Added: lehrst 50
Added: motivierenden 50
Added: müsstest 50
Added: müsstet 50
Added: organisiere 50
Added: peilen 50
Added: probiere 50
Added: rede 50
Added: reserviere 50
Added: sag 120
Added: schickes 80
Added: schickst 90
Added: sitze 50
Added: standet 50
Added: stolpere 50
Added: stressig 50
Added: telefoniere 80
Added: wolltest 100
Added: wolltet 100
Added: würdet 100
Added: ziele 50
Added: ähnlich 50
Added: älteren 50
Added: übelriechend 80
Added: überholen 50
Added: überlege 50
Added: überlegen 50
Added: überlegt 50
Added: übermorgen 50
Added: übernachte 50
Added: überquert 50
Added: überstanden 50
Added: übrig 50
Added: übrigens 50
>>> java/res/raw/main_en.dict
Header :
date : 1402373178 <=> 1412325419
version : 47 <=> 52
Body :
Deleted: Pinterest 25
Added: Edamame 25
Added: Pinterest 25
Added: amd 0
>>> java/res/raw/main_es.dict
Header :
date : 1404131686 <=> 1412325412
version : 49 <=> 52
Body :
Added: cállese 30
Added: mándame 30
Added: recupérate 35
>>> java/res/raw/main_ru.dict
Header :
date : 1406597821 <=> 1412325424
version : 50 <=> 52
Body :
Deleted: Агг 52
Deleted: ЗАГС 77
Deleted: КОНКАКАФ 19
Deleted: Монк 69
Probability changed: НКАО 13 -> 0
Probability changed: НКВД 46 -> 0
Probability changed: НКО 14 -> 0
Probability changed: НКР 22 -> 0
Deleted: НОМОС-БАНК 58
Deleted: ПДД 77
Probability changed: РНК 33 -> 0
Deleted: СМС 78
Probability changed: СНК 35 -> 0
Deleted: ТОО 14
Probability changed: ТЦ 85 -> 5
Probability changed: УНКВД 11 -> 0
Deleted: ФИО 65
Deleted: Эбля 49
Probability changed: асексуальность 59 -> 0
Probability changed: бисексуал 72 -> 0
Probability changed: бисексуалов 85 -> 0
Probability changed: бисексуальной 67 -> 0
Probability changed: бисексуальности 75 -> 0
Deleted: бумажке 94
Deleted: бумажку 104
Deleted: важней 86
Deleted: вероника 58
Deleted: вероники 54
Deleted: вероникой 29
Deleted: веронику 29
Deleted: влезет 94
Deleted: влезть 87
Deleted: врожденная 75
Deleted: врожденного 78
Deleted: врожденное 71
Deleted: врожденной 85
Deleted: врожденную 66
Deleted: врожденные 82
Deleted: врожденный 82
Deleted: врожденным 79
Deleted: врожденными 76
Deleted: врожденных 86
Probability changed: врождённая 68 -> 75
Probability changed: врождённое 69 -> 71
Probability changed: врождённой 80 -> 85
Probability changed: врождённые 78 -> 82
Probability changed: врождённый 77 -> 82
Probability changed: врождённым 74 -> 79
Probability changed: врождённых 80 -> 86
Probability changed: все-таки 113 -> 30
Deleted: вылезли 88
Deleted: г-же 65
Deleted: г-н 88
Deleted: г-на 88
Probability changed: га 135 -> 0
Probability changed: гг 160 -> 0
Probability changed: гетеросексуалов 73 -> 0
Probability changed: гетеросексуального 67 -> 0
Probability changed: гетеросексуальной 71 -> 0
Probability changed: гетеросексуальности 65 -> 0
Probability changed: гетеросексуальность 67 -> 0
Probability changed: гетеросексуальную 65 -> 0
Probability changed: гетеросексуальные 76 -> 0
Probability changed: гетеросексуальных 77 -> 0
Probability changed: гомосексуал 74 -> 0
Probability changed: гомосексуала 67 -> 0
Probability changed: гомосексуалам 75 -> 0
Probability changed: гомосексуалами 70 -> 0
Probability changed: гомосексуализм 91 -> 0
Probability changed: гомосексуализма 91 -> 0
Probability changed: гомосексуализме 74 -> 0
Probability changed: гомосексуализму 68 -> 0
Probability changed: гомосексуалист 80 -> 0
Probability changed: гомосексуалиста 72 -> 0
Probability changed: гомосексуалистам 69 -> 0
Probability changed: гомосексуалистами 69 -> 0
Probability changed: гомосексуалистов 94 -> 0
Probability changed: гомосексуалистом 78 -> 0
Probability changed: гомосексуалисты 77 -> 0
Probability changed: гомосексуалов 93 -> 0
Probability changed: гомосексуалом 65 -> 0
Probability changed: гомосексуалы 82 -> 0
Probability changed: гомосексуальная 70 -> 0
Probability changed: гомосексуального 78 -> 0
Probability changed: гомосексуальное 71 -> 0
Probability changed: гомосексуальной 93 -> 0
Probability changed: гомосексуальности 103 -> 0
Probability changed: гомосексуальность 100 -> 0
Probability changed: гомосексуальностью 73 -> 0
Probability changed: гомосексуальную 75 -> 0
Probability changed: гомосексуальные 92 -> 0
Probability changed: гомосексуальный 75 -> 0
Probability changed: гомосексуальным 74 -> 0
Probability changed: гомосексуальными 70 -> 0
Probability changed: гомосексуальных 91 -> 0
Probability changed: д-р 93 -> 0
Deleted: дада 72
Deleted: даша 55
Deleted: даши 47
Deleted: дашу 29
Probability changed: де 154 -> 30
Probability changed: др 156 -> 0
Deleted: зажги 92
Deleted: зажгу 89
Deleted: зажигай 95
Deleted: зажигаю 88
Probability changed: зоосексуальность 65 -> 0
Probability changed: иРНК 68 -> 0
Probability changed: кДНК 62 -> 0
Probability changed: кв 133 -> 0
Deleted: кио 49
Deleted: лег 91
Deleted: лезу 88
Deleted: лезь 91
Probability changed: ля 103 -> 30
Probability changed: мРНК 102 -> 0
Deleted: машка 29
Probability changed: микроРНК 65 -> 0
Deleted: мону 29
Probability changed: мтДНК 79 -> 0
Probability changed: мяРНК 65 -> 0
Deleted: нажрался 97
Deleted: налил 97
Deleted: налили 86
Probability changed: негетеросексуальной 73 -> 0
Probability changed: негетеросексуальный 73 -> 0
Deleted: орут 98
Deleted: отт 64
Deleted: паша 83
Deleted: паше 66
Deleted: пашей 69
Deleted: пашой 73
Deleted: подоконник 88
Deleted: подскажет 87
Deleted: подскажете 89
Deleted: подскажите 112
Deleted: покажите 95
Deleted: полезли 91
Probability changed: пр 129 -> 0
Probability changed: пре-мРНК 78 -> 0
Deleted: пресекся 73
Probability changed: рРНК 91 -> 0
Deleted: раздражённо 91
Deleted: сажусь 99
Deleted: саше 54
Probability changed: секс 106 -> 0
Probability changed: секс-символ 74 -> 0
Probability changed: секс-символов 65 -> 0
Probability changed: секс-символом 74 -> 0
Probability changed: секс-туризм 62 -> 0
Probability changed: секса 105 -> 0
Probability changed: сексе 93 -> 0
Deleted: секси 88
Probability changed: сексизм 63 -> 0
Probability changed: сексизма 72 -> 0
Probability changed: сексолог 75 -> 0
Probability changed: сексологии 80 -> 0
Probability changed: сексом 102 -> 0
Probability changed: сексу 80 -> 0
Probability changed: сексуальная 95 -> 0
Probability changed: сексуально 88 -> 0
Probability changed: сексуального 107 -> 0
Probability changed: сексуальное 98 -> 0
Probability changed: сексуальной 111 -> 0
Probability changed: сексуальном 84 -> 0
Probability changed: сексуальному 79 -> 0
Probability changed: сексуальности 99 -> 0
Probability changed: сексуальность 90 -> 0
Probability changed: сексуальностью 70 -> 0
Probability changed: сексуальную 95 -> 0
Probability changed: сексуальные 105 -> 0
Probability changed: сексуальный 91 -> 0
Probability changed: сексуальным 95 -> 0
Probability changed: сексуальными 84 -> 0
Probability changed: сексуальных 113 -> 0
Deleted: сете 78
Deleted: слезой 87
Deleted: соображаю 90
Probability changed: тРНК 86 -> 0
Deleted: тав 69
Probability changed: транссексуал 67 -> 0
Probability changed: транссексуалки 64 -> 0
Probability changed: транссексуалов 82 -> 0
Probability changed: транссексуалы 71 -> 0
Probability changed: транссексуальности 77 -> 0
Probability changed: транссексуальность 65 -> 0
Deleted: укажите 83
Probability changed: ул 137 -> 0
Deleted: устар 93
Deleted: эдак 99
Added: Вероника 58
Added: Вероники 54
Added: Вероникой 29
Added: Веронику 29
Added: Даша 55
Added: Даши 47
Added: Дашу 29
Added: Маш 57
Added: Машка 29
Added: Паша 83
Added: Паше 66
Added: Пашей 69
Added: Пашой 73
Added: Саше 54
Added: впросак 0
Added: врождённую 66
Added: втечение 0
Added: втечении 0
Added: лёг 97
Added: машу 80
Added: чтоли 0
Added: чтоль 0
Added: ща 0
Added: щас 0
Change-Id: I0c6bf1a1ecc9edf03523bfb080774738aa40d163
2014-10-06 10:13:37 +09:00
Jean Chalard
bb0d93c4b0
Update dictionaries
...
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1403847862 <=> 1404131686
version : 48 <=> 49
Body :
Added: apurate 50
Added: bondi 50
Added: chamuyar 50
Added: conocela 50
Added: conocelo 50
Added: conoceme 50
Added: conocenos 50
Added: conocete 50
Added: copate 50
Added: creele 50
Added: creeme 50
Added: creenos 50
Added: creete 50
Added: creiste 50
Added: creés 50
Added: dale 50
Added: dame 50
Added: danos 50
Added: decile 50
Added: decime 50
Added: decinos 50
Added: estate 50
Added: hablale 50
Added: hablales 50
Added: hablame 50
Added: hablanos 50
Added: hablate 50
Added: hablá 50
Added: hacele 50
Added: haceme 50
Added: hacenos 50
Added: hacete 50
Added: hacés 50
Added: llegás 50
Added: llevale 50
Added: llevame 50
Added: llevanos 50
Added: llevate 50
Added: llevá 50
Added: llevás 50
Added: parecé 50
Added: parecés 50
Added: pasala 50
Added: pasale 50
Added: pasales 50
Added: pasalo 50
Added: pasame 50
Added: pasanos 50
Added: pasate 50
Added: pasás 50
Added: podés 50
Added: ponele 50
Added: poneme 50
Added: ponenos 50
Added: ponete 50
Added: quedá 50
Added: querela 50
Added: querelo 50
Added: quereme 50
Added: querenos 50
Added: querete 50
Added: querés 50
Added: rascate 50
Added: sabelo 50
Added: sabés 50
Added: tenele 50
Added: teneme 50
Added: tenenos 50
Added: tenete 50
Added: tenés 50
>>> java/res/raw/main_es.dict
Header :
date : 1403847862 <=> 1404131686
version : 48 <=> 49
Body :
Same changes
Bug: 8010862
Change-Id: I98fc8542e21e35a7c80b332148c461144425e61a
2014-07-01 18:19:30 +09:00
Jean Chalard
a70b710c9d
Update the Spanish dictionary
...
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1403153360 <=> 1403847862
version : 47 <=> 48
Body :
Added: bañate 30
Added: correte 30
Added: duchate 30
Added: mostrame 40
Added: muestrame 40
Added: prestame 40
Added: sos 100
>>> java/res/raw/main_es.dict
Header :
date : 1403153360 <=> 1403847862
version : 47 <=> 48
Body :
Added: bañate 30
Added: correte 30
Added: duchate 30
Added: mostrame 40
Added: muestrame 40
Added: prestame 40
Added: sos 100
Bug: 8010862
Change-Id: I0a478b5fd5edfadea420f306dc9b2d98876c246e
2014-06-27 14:56:29 +09:00
Jean Chalard
75bc45cb12
Update dictionaries
...
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1401802362 <=> 1403153360
version : 45 <=> 47
Body :
Added: grandísimo 30
>>> java/res/raw/main_es.dict
Header :
date : 1401802362 <=> 1403153360
version : 45 <=> 47
Body :
Added: grandísimo 30
Bug: 15719556
Change-Id: Ifaa97d40d52a278e41f4dd1292781494d4eb939b
2014-06-23 16:56:00 +09:00
Jean Chalard
ff3e488e1e
Enrich the Spanish dictionary.
...
Enrich the dictionary with many words generated from stems
extracted from the dictionary and rules written by hand.
This adds 45,619 words to the dictionary. Hopefully, almost none
of them is incorrect, though a lot are not very common.
Bug: 8010862
Change-Id: I51c7ebd16ff859ec1e765b0604dd1cfca159ab08
2014-06-03 22:48:19 +09:00
Jean Chalard
004cec01a9
Update all dicts to version 44.
...
Bug: 13164302
Change-Id: I8dc1a839c7dcfaa08a53e26cb6600e9f871447ce
2014-02-24 21:27:25 +09:00
Jean Chalard
a267ebed5a
Update dictionaries
...
Add KitKat to all dictionaries.
Version
da, fi, pl : 29 → 40
cs, de, hr, it, lt, lv, nb, nl, sl, sr, sv, tr : 35 → 40
es : 36 → 40
en_gb, en_us, en, fr, pt_br, pt_pt : 39 → 40
Bug: 10958192
Change-Id: I14436616285ced5eb3b70b8c44b9243da94eed4f
2013-09-30 07:12:03 +00:00
Jean Chalard
665e4ecc62
Update dictionaries
...
>>> dictionaries/en_GB_wordlist.combined.gz
Header :
date : 1374634548 <=> 1374721653
Body :
Added: Caltrain 30
>>> dictionaries/en_US_wordlist.combined.gz
Header :
date : 1374634548 <=> 1374721654
Body :
Added: Caltrain 30
>>> dictionaries/en_wordlist.combined.gz
Header :
date : 1374634568 <=> 1374721663
Body :
Added: Caltrain 30
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1372393817 <=> 1374721654
version : 35 <=> 36
Body :
Added: Caltrain 10
>>> java/res/raw/main_en.dict
Header :
date : 1374634568 <=> 1374721663
Body :
Added: Caltrain 30
>>> java/res/raw/main_es.dict
Header :
date : 1372393817 <=> 1374721654
version : 35 <=> 36
Body :
Added: Caltrain 10
Bug: 9995706
Change-Id: Icf96bf01e45ef94d3ffd6d6a9d6431c52f0f5a86
2013-07-25 12:48:55 +09:00
Jean Chalard
ffe7dbbe7a
Update dictionaries
...
>>> dictionaries/cs_wordlist.combined.gz
Header :
date : 1355802831 <=> 1372393817
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/de_wordlist.combined.gz
Header :
date : 1355802835 <=> 1372393817
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/en_GB_wordlist.combined.gz
Header :
date : 1366272052 <=> 1372393817
version : 31 <=> 35
Body :
Deleted: Sea 126
Added: LTE 25
>>> dictionaries/en_US_wordlist.combined.gz
Header :
date : 1366272093 <=> 1372393817
version : 31 <=> 35
Body :
Added: LTE 25
>>> dictionaries/en_wordlist.combined.gz
Header :
date : 1366272977 <=> 1372393837
version : 31 <=> 35
Body :
Deleted: Sea 126
Added: LTE 25
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1355802832 <=> 1372393817
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/fr_wordlist.combined.gz
Header :
date : 1366272255 <=> 1372393818
version : 31 <=> 35
Body :
Deleted: R'n'B 95
Deleted: count 60
Deleted: d'Inti 34
Added: beurk 25
>>> dictionaries/hr_wordlist.combined.gz
Header :
date : 1355802836 <=> 1372393818
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/it_wordlist.combined.gz
Header :
date : 1355802836 <=> 1372393818
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/lt_wordlist.combined.gz
Header :
date : 1355802843 <=> 1372393818
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/lv_wordlist.combined.gz
Header :
date : 1355802843 <=> 1372393818
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/nb_wordlist.combined.gz
Header :
date : 1366003450 <=> 1372393818
version : 31 <=> 35
Body :
Added: LTE 25
>>> dictionaries/nl_wordlist.combined.gz
Header :
date : 1355802844 <=> 1372393818
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/ru_wordlist.combined.gz
Header :
date : 1370244430 <=> 1372393835
version : 34 <=> 35
Body :
Freq changed: связывание 93 -> 0
>>> dictionaries/sl_wordlist.combined.gz
Header :
date : 1355802835 <=> 1372393835
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/sr_wordlist.combined.gz
Header :
date : 1355802853 <=> 1372393835
version : 29 <=> 35
Body :
Added: LTE 25
>>> dictionaries/sv_wordlist.combined.gz
Header :
date : 1366003804 <=> 1372393836
version : 31 <=> 35
Body :
Added: LTE 25
>>> dictionaries/tr_wordlist.combined.gz
Header :
date : 1355802858 <=> 1372393837
version : 29 <=> 35
Body :
Added: LTE 25
>>> java/res/raw/main_de.dict
Header :
date : 1355802835 <=> 1372393817
version : 29 <=> 35
Body :
Added: LTE 25
>>> java/res/raw/main_en.dict
Header :
date : 1366272977 <=> 1372393837
version : 31 <=> 35
Body :
Deleted: Sea 126
Added: LTE 25
>>> java/res/raw/main_es.dict
Header :
date : 1355802832 <=> 1372393817
version : 29 <=> 35
Body :
Added: LTE 25
>>> java/res/raw/main_fr.dict
Header :
date : 1366272255 <=> 1372393818
version : 31 <=> 35
Body :
Deleted: R'n'B 95
Deleted: count 60
Deleted: d'Inti 34
Added: beurk 25
>>> java/res/raw/main_it.dict
Header :
date : 1355802836 <=> 1372393818
version : 29 <=> 35
Body :
Added: LTE 25
>>> java/res/raw/main_ru.dict
Header :
date : 1370244430 <=> 1372393835
version : 34 <=> 35
Body :
Freq changed: связывание 93 -> 0
Bug: 9301610
Bug: 9607966
Change-Id: I1117ed85d97fbb0ee50f11bc31776f1970b56f12
2013-06-28 14:54:51 +09:00
Jean Chalard
21dbe3701c
Update dictionaries
...
cs, da, de, el, es, fi, fr, hr, it, lt, lv, nb, nl, pl,
pt_BR, pt_PT, sl, sr, sv, tr : rescale frequencies to match
spec. This has no large effect in the practice except the
dictionary will become stronger vs spatial model (especially in
lower count corpora, like lt, lv, sr)
en* : Small changes (rounding going the other way essentially)
ru : the above rescaling, and remove the following words:
Дре, ОСТа, Планше, легкими, легком, легкому, легкости,
легкую, нелегкие, нелегкий, нелегким, нелегкое, нелегкой,
нелегкую, полулегком and add нелёгкие, нелёгкое, нелёгкую;
other accented forms were already in the dictionary.
Change-Id: I40386c2ebd4d2be38874e822bde89db7cb512ae6
2012-12-18 13:06:48 +09:00
Jean Chalard
d080986f93
Update dictionaries
...
>>> dictionaries/en_GB_wordlist.combined.gz
Header :
date : 1354870724 <=> 1355112440
version : 27 <=> 28
Body :
Deleted: DoCoMo 65
Added: Docomo 65
Added: KDDI 25
Added: Softbank 25
>>> dictionaries/en_US_wordlist.combined.gz
Header :
date : 1354870736 <=> 1355112451
version : 27 <=> 28
Body :
Deleted: DoCoMo 65
Added: Docomo 65
Added: KDDI 25
Added: Softbank 25
>>> dictionaries/en_wordlist.combined.gz
Header :
date : 1354870744 <=> 1355112460
version : 27 <=> 28
Body :
Deleted: DoCoMo 65
Added: Docomo 65
Added: KDDI 25
Added: Softbank 25
>>> dictionaries/es_wordlist.combined.gz
Header :
date : 1351676002 <=> 1355117676
version : 26 <=> 28
Body :
Deleted: DoCoMo 40
Added: Docomo 40
Added: KDDI 25
Added: Softbank 25
>>> dictionaries/fi_wordlist.combined.gz
Header :
date : 1351676054 <=> 1355117691
version : 26 <=> 28
Body :
Deleted: DoCoMo 28
Added: Docomo 28
Added: KDDI 25
Added: Softbank 25
>>> dictionaries/fr_wordlist.combined.gz
Header :
date : 1354872988 <=> 1355117708
version : 27 <=> 28
Body :
Deleted: DoCoMo 52
Added: Docomo 52
Added: KDDI 25
Added: Softbank 25
>>> dictionaries/pt_PT_wordlist.combined.gz
Header :
date : 1351676510 <=> 1355117723
version : 26 <=> 28
Body :
Deleted: DoCoMo 48
Added: Docomo 48
Added: Softbank 25
>>> java/res/raw/main_en.dict
Header :
date : 1354870744 <=> 1355112460
version : 27 <=> 28
Body :
Deleted: DoCoMo 65
Added: Docomo 65
Added: KDDI 25
Added: Softbank 25
>>> java/res/raw/main_es.dict
Header :
date : 1353500806 <=> 1355117676
version : 27 <=> 28
Body :
Deleted: DoCoMo 40
Added: Docomo 40
Added: KDDI 25
Added: Softbank 25
>>> java/res/raw/main_fr.dict
Header :
date : 1354872988 <=> 1355117708
version : 27 <=> 28
Body :
Deleted: DoCoMo 52
Added: Docomo 52
Added: KDDI 25
Added: Softbank 25
Change-Id: I3801cbe4535407f55ede8db327674d493a92d1ae
2012-12-10 14:52:43 +09:00
Jean Chalard
a424ff06ec
Switch the AOSP word lists to the combined format.
...
This will help with managing the word lists.
Bug: 7388859
Change-Id: I89f049569b177d3027fe56d6c67eaca27d44dc7d
2012-10-31 18:52:00 +09:00