Commit graph

33746 commits

Author SHA1 Message Date
Keisuke Kuroyanagi
2cf5550749 Fix: BoS prediction after inputting just once.
Change-Id: Ib69569ab6b6edfcc8c1d2c621b95de4127789ab6
2014-11-01 17:58:22 +09:00
Baligh Uddin
2af3fc3275 am 25b6185d: (-s ours) am 19eefc51: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '25b6185d1cd26f91ee417906714fa6ceb9fb8705':
  Import translations. DO NOT MERGE
2014-10-31 17:16:14 +00:00
Baligh Uddin
28b90dce0b am 971297e3: (-s ours) am 084de3c9: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '971297e38d02b6ea236e08f28c7d5cb3ee44e33f':
  Import translations. DO NOT MERGE
2014-10-31 17:16:12 +00:00
Baligh Uddin
d064c64481 am 929e97e4: (-s ours) am 46905cfd: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '929e97e40bd74a6de8ed619e145a332ab1134e81':
  Import translations. DO NOT MERGE
2014-10-31 17:16:11 +00:00
Baligh Uddin
753c65f067 am ed8b51a7: (-s ours) am 46315a3e: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit 'ed8b51a7fbe02c5534d1855d3075eb4792a9cb84':
  Import translations. DO NOT MERGE
2014-10-31 17:16:09 +00:00
Baligh Uddin
cab2a02aa7 am 7fe0ea42: (-s ours) am f8de10ce: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '7fe0ea4218a2ca1c663076d8e52ffeb7d842ecc4':
  Import translations. DO NOT MERGE
2014-10-31 17:16:07 +00:00
Baligh Uddin
dba15396b8 am a42c4a83: (-s ours) am 1d379712: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit 'a42c4a83664c47a6ddd081d9bb7b2fdb3c07bd9e':
  Import translations. DO NOT MERGE
2014-10-31 17:16:06 +00:00
Baligh Uddin
c31b9528ff am b6b0b609: (-s ours) am b2039bcd: (-s ours) Import translations. DO NOT MERGE
* commit 'b6b0b609a49ea81bb280e04b117915dc0dc13799':
  Import translations. DO NOT MERGE
2014-10-31 17:16:04 +00:00
Baligh Uddin
25b6185d1c am 19eefc51: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '19eefc5109b65b1ecf74190904a1c1aa3b30b640':
  Import translations. DO NOT MERGE
2014-10-31 17:07:51 +00:00
Baligh Uddin
971297e38d am 084de3c9: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '084de3c970cb10b02be6844d3a78ad69b487ff7c':
  Import translations. DO NOT MERGE
2014-10-31 17:07:49 +00:00
Baligh Uddin
929e97e40b am 46905cfd: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '46905cfdec1c3f69cd88025d4e62ebcd078937d3':
  Import translations. DO NOT MERGE
2014-10-31 17:07:46 +00:00
Baligh Uddin
ed8b51a7fb am 46315a3e: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '46315a3e2d7143781902fed103df28d3964109bb':
  Import translations. DO NOT MERGE
2014-10-31 17:07:44 +00:00
Baligh Uddin
7fe0ea4218 am f8de10ce: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit 'f8de10ce2163343ab4b39b762bb1e6b5a4449069':
  Import translations. DO NOT MERGE
2014-10-31 17:07:42 +00:00
Baligh Uddin
a42c4a8366 am 1d379712: (-s ours) Merge "Import translations. DO NOT MERGE" into stage-aosp-master
* commit '1d379712e5e7fae305d1f24665b711050b1e510c':
  Import translations. DO NOT MERGE
2014-10-31 17:07:40 +00:00
Baligh Uddin
b6b0b609a4 am b2039bcd: (-s ours) Import translations. DO NOT MERGE
* commit 'b2039bcd7bad88b4cbc2e113a4147a5545e3c106':
  Import translations. DO NOT MERGE
2014-10-31 17:07:38 +00:00
Baligh Uddin
19eefc5109 Merge "Import translations. DO NOT MERGE" into stage-aosp-master 2014-10-31 17:02:08 +00:00
Baligh Uddin
084de3c970 Merge "Import translations. DO NOT MERGE" into stage-aosp-master 2014-10-31 17:02:05 +00:00
Baligh Uddin
46905cfdec Merge "Import translations. DO NOT MERGE" into stage-aosp-master 2014-10-31 17:02:03 +00:00
Baligh Uddin
46315a3e2d Merge "Import translations. DO NOT MERGE" into stage-aosp-master 2014-10-31 17:02:00 +00:00
Baligh Uddin
f8de10ce21 Merge "Import translations. DO NOT MERGE" into stage-aosp-master 2014-10-31 17:01:57 +00:00
Baligh Uddin
1d379712e5 Merge "Import translations. DO NOT MERGE" into stage-aosp-master 2014-10-31 17:01:55 +00:00
Keisuke Kuroyanagi
b3bae2e89b Merge "Update v4 format version from 402 to 403." 2014-10-31 14:19:44 +00:00
Keisuke Kuroyanagi
ef931546a0 Merge "Add hacks for better handling count value during migration." 2014-10-31 13:53:57 +00:00
Keisuke Kuroyanagi
a88c9682fc Merge "Change v403 historical info format." 2014-10-31 13:38:38 +00:00
Keisuke Kuroyanagi
3cde19ded1 Merge "Initial commit for native dicttoolkit." 2014-10-31 11:29:20 +00:00
Keisuke Kuroyanagi
e101a53ffc Initial commit for native dicttoolkit.
Bug: 10059681

Change-Id: Ib730af8ebc944e08aaada869c0626724a499747c
2014-10-31 20:27:06 +09:00
Tadashi G. Takaoka
ef8068f9a8 Merge "Add helper methods to convert flags to readable text" 2014-10-31 10:12:37 +00:00
Adrian Velicu
372ca14dea Merge "Further fixes to treat 0-frequency words" 2014-10-31 09:26:02 +00:00
Jean Chalard
652b655eab Remove some dicttool dependencies
Bug: 18108776
Change-Id: Ic81c5df2a685160c249021ff4cfe41553dbdeebe
2014-10-31 17:52:46 +09:00
Keisuke Kuroyanagi
ea468cc9de Update v4 format version from 402 to 403.
Without personalization:
Total words: 1134774, Success Num: 899230, Success Percentage: 79.243%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1871, Bad Failure Percentage: 0.165%
Failures, with auto-correction (F-C): 29084, F-C Percentage: 2.563%
Max Keystrokes: 6072959, Min Keystrokes: 4436090, Keystroke Saving Percentage:26.953%

Before:
Total words: 1134646, Success Num: 925194, Success Percentage: 81.540%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1316, Bad Failure Percentage: 0.116%
Failures, with auto-correction (F-C): 28288, F-C Percentage: 2.493%
Max Keystrokes: 6072831, Min Keystrokes: 3946188, Keystroke Saving Percentage:35.019%

After
Total words: 1134659, Success Num: 944746, Success Percentage: 83.263%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1258, Bad Failure Percentage: 0.111%
Failures, with auto-correction (F-C): 28016, F-C Percentage: 2.469%
Max Keystrokes: 6072844, Min Keystrokes: 3387333, Keystroke Saving Percentage:44.222%

Change-Id: I3af42ec37a11847c0429c28616e726f6a339247f
2014-10-31 17:23:39 +09:00
Keisuke Kuroyanagi
c611989929 Add hacks for better handling count value during migration.
Bug: 14425059
Change-Id: Ib050574aa7c4babd4285322a11c3af9be9fbab1e
2014-10-31 17:22:13 +09:00
Keisuke Kuroyanagi
2383575d2d Change v403 historical info format.
count -> 2B, level -> 0B.

Change-Id: I3b241126f56eb33cdf09cb1ebfed04f534e4ec48
2014-10-31 17:22:13 +09:00
Jean Chalard
5840a7c6e7 Merge "Remove Dict dependency on WordComposer and ProximityInfo" 2014-10-31 07:59:12 +00:00
Adrian Velicu
434c08a79e Merge "Block offensive words in multi-word suggestions" 2014-10-31 07:31:25 +00:00
Tadashi G. Takaoka
9dc3ae84ee Add helper methods to convert flags to readable text
Change-Id: I23608927cef8a897ff4ad8c938560cc78093ddac
2014-10-31 16:20:20 +09:00
Ken Wakasa
ed575b62da Merge "Add documentation for CollectionUtils#arrayAsList" 2014-10-31 07:07:07 +00:00
Adrian Velicu
009e02ce4a Further fixes to treat 0-frequency words
Previously, when both legitimate 0-frequency words (such as
distracters) and offensive words were encoded in the same
way, distracters would never show up when the user blocked
offensive words (the default setting, as well as the setting
for regression tests).

When b/11031090 was fixed and a separate encoding was used
for offensive words, 0-frequency words would no longer be
blocked when they were an "exact match" (where case
mismatches and accent mismatches would be considered an
"exact match"). The exact match boosting functionality meant
that, for example, when the user typed "mt" they would be
suggested the word "Mt", although they most probably meant
to type "my".

For this reason, we introduced this change, which does the
following:
* Defines the "perfect match" as a really exact match, with
no room for case or accent mismatches
* When the target word has probability zero (as "Mt" does,
because it is a distracter), ONLY boost its score if it is a
perfect match.

By doing this, when the user types "mt", the word "Mt" will
NOT be boosted, and they will get "my". However, if the user
makes an explicit effort to type "Mt", we do boost the word
"Mt" so that the user's input is not autocorrected to "My".

Bug: 11031090
Change-Id: I92ee1b4e742645d52e2f7f8c4390920481e8fff0
2014-10-31 15:58:50 +09:00
Adrian Velicu
10416241f7 Block offensive words in multi-word suggestions
If the user has chosen to block offensive words and types
"aaaxbb", where "aaa" is an offensive word and "bb" is not,
we should not suggest "aaa bb".

Bug: 11031090
Change-Id: Ie23b8dd5d347bc26b1c046c3f5e8dfbc259bf528
2014-10-31 15:58:50 +09:00
Adrian Velicu
61d43e5c94 Merge "Update unit tests for new static dict version." 2014-10-31 06:49:51 +00:00
Adrian Velicu
f59c3b6238 Merge "Update dictionaries (possibly_offensive flag)" 2014-10-31 06:49:42 +00:00
Adrian Velicu
aa20342d7e Merge "Using "blacklist" flag as "possibly offensive"" 2014-10-31 06:49:29 +00:00
David Faden
d443146c34 Add documentation for CollectionUtils#arrayAsList
(This is a cherrypick of http://ag/576739)

Also, modify the IllegalArgumentException to capture the
values passed for start, end, and array.length to make
debugging easier in the unlikely event that it is thrown.
Add corresponding tests for the exceptions.

Tested:
tapas LatinImeGoogle LatinImeGoogleTests userdebug
make -j30 && \
adb install -r \
out/target/product/generic/data/app/LatinImeGoogleTests/LatinImeGoogleTests.apk && \
adb install -r \
out/target/product/generic/system/app/LatinImeGoogle/LatinImeGoogle.apk && \
adb shell am instrument -w -e class \
com.android.inputmethod.latin.utils.CollectionUtilsTests \
com.google.android.inputmethod.latin.tests/android.support.test.runner.AndroidJUnitRunner

Change-Id: I0c579ca4f4f2a9b2575aa6f8fa75e3ee4a21cf8a
2014-10-31 06:18:35 +00:00
Adrian Velicu
88480f39f6 Update unit tests for new static dict version.
Bug: 11031090
Change-Id: Ifd7b241f1a0a30678ca7de97cc6e96927a9b5c8f
2014-10-31 14:50:18 +09:00
Adrian Velicu
8dd31a28ae Update dictionaries (possibly_offensive flag)
Correctly encoding possibly offensive words with their correct
frequency and the possibly_offensive flag set.

Continuing to encode with zero frequency only distracters or
words that should never come up.

https://paste.googleplex.com/5167060875214848

Bug: 11031090
Change-Id: Ia394b1827f292ff8d4791cc2f3e6e50b5aff4cbe
2014-10-31 14:49:24 +09:00
Adrian Velicu
7c87859d4c Using "blacklist" flag as "possibly offensive"
Bug: 11031090
Change-Id: I5cc0d006ab003656498eb82b0875eb9c051d331e
2014-10-31 14:33:05 +09:00
Keisuke Kuroyanagi
0cd1f222fd Fix: native unit test build.
Change-Id: Id2bd4b60d6a4023815a630ebb3059a435b72c193
2014-10-31 12:50:45 +09:00
Keisuke Kuroyanagi
c096100b01 Merge "Enable count based dynamic ngram language model for v403." 2014-10-31 03:15:20 +00:00
Baligh Uddin
b2039bcd7b Import translations. DO NOT MERGE
Change-Id: I7e725d123380b47feb9f3442cf5ac4299997d1a9
Auto-generated-cl: translation import
2014-10-30 11:06:09 -07:00
Baligh Uddin
f6a4367279 Import translations. DO NOT MERGE
Change-Id: I7db7b88569ffbe44449f97950cb337ea2f3b1761
Auto-generated-cl: translation import
2014-10-30 11:04:58 -07:00
Baligh Uddin
d107ca5b4e Import translations. DO NOT MERGE
Change-Id: I58e9e2b6fbeea9369903208642f548dcd60db631
Auto-generated-cl: translation import
2014-10-30 11:01:30 -07:00