Commit Graph

36940 Commits (32e00798776c008862f2138e4f8f1320e9a26b16)

Author SHA1 Message Date
Keisuke Kuroyanagi ef931546a0 Merge "Add hacks for better handling count value during migration." 2014-10-31 13:53:57 +00:00
Keisuke Kuroyanagi 60826f008f am a88c9682: Merge "Change v403 historical info format."
* commit 'a88c9682fc97863d27b46932adc1218990e70bd7':
  Change v403 historical info format.
2014-10-31 13:42:14 +00:00
Keisuke Kuroyanagi a88c9682fc Merge "Change v403 historical info format." 2014-10-31 13:38:38 +00:00
Keisuke Kuroyanagi ddda03a625 am 3cde19de: Merge "Initial commit for native dicttoolkit."
* commit '3cde19ded17637e96a9820862d015a4682b8e3f7':
  Initial commit for native dicttoolkit.
2014-10-31 11:32:59 +00:00
Keisuke Kuroyanagi 3cde19ded1 Merge "Initial commit for native dicttoolkit." 2014-10-31 11:29:20 +00:00
Keisuke Kuroyanagi e101a53ffc Initial commit for native dicttoolkit.
Bug: 10059681

Change-Id: Ib730af8ebc944e08aaada869c0626724a499747c
2014-10-31 20:27:06 +09:00
Tadashi G. Takaoka 8c980ff45d am ef8068f9: Merge "Add helper methods to convert flags to readable text"
* commit 'ef8068f9a85af5c402588662b3a7e98076e4c0dd':
  Add helper methods to convert flags to readable text
2014-10-31 10:16:32 +00:00
Tadashi G. Takaoka ef8068f9a8 Merge "Add helper methods to convert flags to readable text" 2014-10-31 10:12:37 +00:00
Adrian Velicu 9367ec5f76 am 372ca14d: Merge "Further fixes to treat 0-frequency words"
* commit '372ca14deafbc12ccd34004a8779a9d24ff1dcf8':
  Further fixes to treat 0-frequency words
2014-10-31 09:29:01 +00:00
Adrian Velicu 372ca14dea Merge "Further fixes to treat 0-frequency words" 2014-10-31 09:26:02 +00:00
Jean Chalard 34ecd94f6d am 652b655e: Remove some dicttool dependencies
* commit '652b655eab485682b9600867b6b9bc860387e7e4':
  Remove some dicttool dependencies
2014-10-31 08:59:26 +00:00
Jean Chalard 652b655eab Remove some dicttool dependencies
Bug: 18108776
Change-Id: Ic81c5df2a685160c249021ff4cfe41553dbdeebe
2014-10-31 17:52:46 +09:00
Keisuke Kuroyanagi ea468cc9de Update v4 format version from 402 to 403.
Without personalization:
Total words: 1134774, Success Num: 899230, Success Percentage: 79.243%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1871, Bad Failure Percentage: 0.165%
Failures, with auto-correction (F-C): 29084, F-C Percentage: 2.563%
Max Keystrokes: 6072959, Min Keystrokes: 4436090, Keystroke Saving Percentage:26.953%

Before:
Total words: 1134646, Success Num: 925194, Success Percentage: 81.540%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1316, Bad Failure Percentage: 0.116%
Failures, with auto-correction (F-C): 28288, F-C Percentage: 2.493%
Max Keystrokes: 6072831, Min Keystrokes: 3946188, Keystroke Saving Percentage:35.019%

After
Total words: 1134659, Success Num: 944746, Success Percentage: 83.263%
Bad Failures, with auto-correction (typed word == expected word, output word != expected word): 1258, Bad Failure Percentage: 0.111%
Failures, with auto-correction (F-C): 28016, F-C Percentage: 2.469%
Max Keystrokes: 6072844, Min Keystrokes: 3387333, Keystroke Saving Percentage:44.222%

Change-Id: I3af42ec37a11847c0429c28616e726f6a339247f
2014-10-31 17:23:39 +09:00
Keisuke Kuroyanagi c611989929 Add hacks for better handling count value during migration.
Bug: 14425059
Change-Id: Ib050574aa7c4babd4285322a11c3af9be9fbab1e
2014-10-31 17:22:13 +09:00
Keisuke Kuroyanagi 2383575d2d Change v403 historical info format.
count -> 2B, level -> 0B.

Change-Id: I3b241126f56eb33cdf09cb1ebfed04f534e4ec48
2014-10-31 17:22:13 +09:00
Jean Chalard a7281a766c am 5840a7c6: Merge "Remove Dict dependency on WordComposer and ProximityInfo"
* commit '5840a7c6e7c6b5eea3269457affaf063fe712944':
  Remove Dict dependency on WordComposer and ProximityInfo
2014-10-31 08:03:38 +00:00
Jean Chalard 5840a7c6e7 Merge "Remove Dict dependency on WordComposer and ProximityInfo" 2014-10-31 07:59:12 +00:00
Adrian Velicu a1ba8ae1bc am 434c08a7: Merge "Block offensive words in multi-word suggestions"
* commit '434c08a79ec60347590e26725372834678a8aa84':
  Block offensive words in multi-word suggestions
2014-10-31 07:35:23 +00:00
Adrian Velicu 434c08a79e Merge "Block offensive words in multi-word suggestions" 2014-10-31 07:31:25 +00:00
Tadashi G. Takaoka 9dc3ae84ee Add helper methods to convert flags to readable text
Change-Id: I23608927cef8a897ff4ad8c938560cc78093ddac
2014-10-31 16:20:20 +09:00
Ken Wakasa ae9bfdc061 am ed575b62: Merge "Add documentation for CollectionUtils#arrayAsList"
* commit 'ed575b62da197e81ace23eeb1af09fd776fa0b41':
  Add documentation for CollectionUtils#arrayAsList
2014-10-31 07:09:52 +00:00
Ken Wakasa ed575b62da Merge "Add documentation for CollectionUtils#arrayAsList" 2014-10-31 07:07:07 +00:00
Adrian Velicu 009e02ce4a Further fixes to treat 0-frequency words
Previously, when both legitimate 0-frequency words (such as
distracters) and offensive words were encoded in the same
way, distracters would never show up when the user blocked
offensive words (the default setting, as well as the setting
for regression tests).

When b/11031090 was fixed and a separate encoding was used
for offensive words, 0-frequency words would no longer be
blocked when they were an "exact match" (where case
mismatches and accent mismatches would be considered an
"exact match"). The exact match boosting functionality meant
that, for example, when the user typed "mt" they would be
suggested the word "Mt", although they most probably meant
to type "my".

For this reason, we introduced this change, which does the
following:
* Defines the "perfect match" as a really exact match, with
no room for case or accent mismatches
* When the target word has probability zero (as "Mt" does,
because it is a distracter), ONLY boost its score if it is a
perfect match.

By doing this, when the user types "mt", the word "Mt" will
NOT be boosted, and they will get "my". However, if the user
makes an explicit effort to type "Mt", we do boost the word
"Mt" so that the user's input is not autocorrected to "My".

Bug: 11031090
Change-Id: I92ee1b4e742645d52e2f7f8c4390920481e8fff0
2014-10-31 15:58:50 +09:00
Adrian Velicu 10416241f7 Block offensive words in multi-word suggestions
If the user has chosen to block offensive words and types
"aaaxbb", where "aaa" is an offensive word and "bb" is not,
we should not suggest "aaa bb".

Bug: 11031090
Change-Id: Ie23b8dd5d347bc26b1c046c3f5e8dfbc259bf528
2014-10-31 15:58:50 +09:00
Adrian Velicu dca729bd21 am 61d43e5c: Merge "Update unit tests for new static dict version."
* commit '61d43e5c941e7a76d614ffbe756137c02a34cdc1':
  Update unit tests for new static dict version.
2014-10-31 06:55:23 +00:00
Adrian Velicu b524aa768c am f59c3b62: Merge "Update dictionaries (possibly_offensive flag)"
* commit 'f59c3b62383711a58366bb77caefb62dced3f116':
  Update dictionaries (possibly_offensive flag)
2014-10-31 06:55:21 +00:00
Adrian Velicu 93a9b836f9 am aa20342d: Merge "Using "blacklist" flag as "possibly offensive""
* commit 'aa20342d7ee40fb13483fc2a663493de185a33cd':
  Using "blacklist" flag as "possibly offensive"
2014-10-31 06:55:19 +00:00
Adrian Velicu 61d43e5c94 Merge "Update unit tests for new static dict version." 2014-10-31 06:49:51 +00:00
Adrian Velicu f59c3b6238 Merge "Update dictionaries (possibly_offensive flag)" 2014-10-31 06:49:42 +00:00
Adrian Velicu aa20342d7e Merge "Using "blacklist" flag as "possibly offensive"" 2014-10-31 06:49:29 +00:00
David Faden d443146c34 Add documentation for CollectionUtils#arrayAsList
(This is a cherrypick of http://ag/576739)

Also, modify the IllegalArgumentException to capture the
values passed for start, end, and array.length to make
debugging easier in the unlikely event that it is thrown.
Add corresponding tests for the exceptions.

Tested:
tapas LatinImeGoogle LatinImeGoogleTests userdebug
make -j30 && \
adb install -r \
out/target/product/generic/data/app/LatinImeGoogleTests/LatinImeGoogleTests.apk && \
adb install -r \
out/target/product/generic/system/app/LatinImeGoogle/LatinImeGoogle.apk && \
adb shell am instrument -w -e class \
com.android.inputmethod.latin.utils.CollectionUtilsTests \
com.google.android.inputmethod.latin.tests/android.support.test.runner.AndroidJUnitRunner

Change-Id: I0c579ca4f4f2a9b2575aa6f8fa75e3ee4a21cf8a
2014-10-31 06:18:35 +00:00
Adrian Velicu 88480f39f6 Update unit tests for new static dict version.
Bug: 11031090
Change-Id: Ifd7b241f1a0a30678ca7de97cc6e96927a9b5c8f
2014-10-31 14:50:18 +09:00
Adrian Velicu 8dd31a28ae Update dictionaries (possibly_offensive flag)
Correctly encoding possibly offensive words with their correct
frequency and the possibly_offensive flag set.

Continuing to encode with zero frequency only distracters or
words that should never come up.

https://paste.googleplex.com/5167060875214848

Bug: 11031090
Change-Id: Ia394b1827f292ff8d4791cc2f3e6e50b5aff4cbe
2014-10-31 14:49:24 +09:00
Adrian Velicu 7c87859d4c Using "blacklist" flag as "possibly offensive"
Bug: 11031090
Change-Id: I5cc0d006ab003656498eb82b0875eb9c051d331e
2014-10-31 14:33:05 +09:00
Tadashi G. Takaoka 8523720b22 Remove deprecated Canvas.clipRegion(Region) call
Change-Id: I88e0a8d327c17bf4bafbe44cdfa265f5d0acb879
2014-10-31 14:06:30 +09:00
Keisuke Kuroyanagi cc9dd07aa3 am 0cd1f222: Fix: native unit test build.
* commit '0cd1f222fd837179f501651256bc15b42317edd0':
  Fix: native unit test build.
2014-10-31 03:55:53 +00:00
Keisuke Kuroyanagi 0cd1f222fd Fix: native unit test build.
Change-Id: Id2bd4b60d6a4023815a630ebb3059a435b72c193
2014-10-31 12:50:45 +09:00
Keisuke Kuroyanagi 2bb77ebf13 am c096100b: Merge "Enable count based dynamic ngram language model for v403."
* commit 'c096100b0191797834ce84e140560c0fadc8b264':
  Enable count based dynamic ngram language model for v403.
2014-10-31 03:19:50 +00:00
Keisuke Kuroyanagi c096100b01 Merge "Enable count based dynamic ngram language model for v403." 2014-10-31 03:15:20 +00:00
Baligh Uddin b2039bcd7b Import translations. DO NOT MERGE
Change-Id: I7e725d123380b47feb9f3442cf5ac4299997d1a9
Auto-generated-cl: translation import
2014-10-30 11:06:09 -07:00
Baligh Uddin f6a4367279 Import translations. DO NOT MERGE
Change-Id: I7db7b88569ffbe44449f97950cb337ea2f3b1761
Auto-generated-cl: translation import
2014-10-30 11:04:58 -07:00
Baligh Uddin d107ca5b4e Import translations. DO NOT MERGE
Change-Id: I58e9e2b6fbeea9369903208642f548dcd60db631
Auto-generated-cl: translation import
2014-10-30 11:01:30 -07:00
Baligh Uddin de4da90db2 Import translations. DO NOT MERGE
Change-Id: I2c081d71d715a47d2baabd8069d6cdf28ca03cdc
Auto-generated-cl: translation import
2014-10-30 11:00:41 -07:00
Baligh Uddin 34bca016c7 Import translations. DO NOT MERGE
Change-Id: I6427702f2022b2423ef102b00edff0a7ffe5122a
Auto-generated-cl: translation import
2014-10-30 11:00:06 -07:00
Baligh Uddin 980c86dcff Import translations. DO NOT MERGE
Change-Id: I5a18946cc2b568f649028a1bc50a4f78e6aad0af
Auto-generated-cl: translation import
2014-10-30 10:59:28 -07:00
Baligh Uddin bfe51d5938 Import translations. DO NOT MERGE
Change-Id: I19b5c3a4956bbf3e22566cce7f74791a98869727
Auto-generated-cl: translation import
2014-10-30 10:58:15 -07:00
Keisuke Kuroyanagi bcb52d73e2 Enable count based dynamic ngram language model for v403.
Bug: 14425059

Change-Id: Icc15e14cfd77d37cd75f75318fd0fa36f9ca7a5b
2014-10-30 23:38:19 +09:00
Keisuke Kuroyanagi d179f62cdb am 04a492cb: Merge "Add DynamicLanguageModelProbabilityUtils."
* commit '04a492cb490e2dc38214f5cfa99283a886e6acbb':
  Add DynamicLanguageModelProbabilityUtils.
2014-10-30 14:10:57 +00:00
Keisuke Kuroyanagi 04a492cb49 Merge "Add DynamicLanguageModelProbabilityUtils." 2014-10-30 14:08:17 +00:00
Keisuke Kuroyanagi 660b00477c Add DynamicLanguageModelProbabilityUtils.
Bug: 14425059
Change-Id: Ia58ab3f0ead02798046d182a9464dcbd95f086bc
2014-10-30 21:33:57 +09:00