LatinIME

Author	SHA1	Message	Date
Jean Chalard	a787dba83b	Fix a bug with umlaut processing. Issue: 3275926 Change-Id: Ibcb00aaea3ff05ad59ad4e8e54dd3caab5ab9bca	2011-03-04 13:07:07 +09:00
Jean Chalard	c2bbc6a449	Use translation of fallback umlauts digraphs for German. For German : handle "ae", "oe" and "ue" to be alternate forms for umlaut-bearing versions of "a", "o" and "u". Issue: 3275926 Change-Id: I056c707cdacc464ceab63be56c016c7f8439196c	2011-03-03 11:52:23 +09:00
satok	8fbd552292	Add proximity info to native Bug: 3311719 Change-Id: Ie596304070e321ad23fb67a13bf05e2b6af1b54b	2011-02-23 23:04:00 +09:00
Jean Chalard	f5f834afcd	Rename variables with obscure names. The `snr' variable has a very obscure name. Rename it to `matchWeight'. Also, the `toLowerCase' function is error-prone, since it actually returns a lower case version of the BASE char, that is without diacritics. Hence, rename it to `toBaseLowerCase' and update variables with similar names. Change-Id: Ibdbe73018a33ee864db59a51d664c3b104d5fb3f	2011-02-22 16:43:19 +09:00
Jean Chalard	a5d5849701	Force autocorrection of matching words with different accents. When entering a word without accents the user expects the system to add accents automatically if there is no other matching word. This patch ensures the accented version is promoted accordingly and autocorrection really takes place. Issue: 3400015 Change-Id: I8cd3db5bf131ec6844b26abecc1ecbd1d6269df4	2011-02-22 15:27:06 +09:00
Tadashi G. Takaoka	887f11ee43	Remove next letters frequency handling Bug: 3428942 Change-Id: Id62f467ce4e50c60a56d59bf96770e799a4659e2	2011-02-17 13:59:41 +09:00
Jean Chalard	8dc754a411	Promote full matches with differing accents. Stop considering accented characters as different from their base character for proximity scoring. Also give a huge boost (basically overriding frequency) to a word fully matched with only differing accents. Bug: 2550587 Change-Id: I2da7a71229fb3868d9e4a53703ccf8caeb6fcf10	2011-01-27 17:29:24 +09:00
satok	fd16f1d2a3	Handle the last char correctly in excessive char correction algortihm. bug: 3278422 Change-Id: I651d3cb0130ab9834ed9d7a97f41360c6eaa9de1	2011-01-27 16:44:54 +09:00
satok	58c49b9132	Fix auto-correction threshold and promote full matched words Bug: 3374359 Bug: 3278422 "zbe" will be auto corrected to "be" by fixing s-line "teh" will be auto corrected to "the" by promotion of full matched words Change-Id: I314c632820e4e0b1501edeca60ada205d291451f	2011-01-27 12:53:13 +09:00
Ken Wakasa	e90b333017	Load main dic in native Follow up to Id57dce51 bug: 3219819 Change-Id: I00e11ef21d0252ffa88c12dffb9c55b0f2e19a66	2011-01-07 19:51:45 +09:00
satok	f7425bb15b	Supress overflow at mulitplying demotion rate Change-Id: I2003c5f88a5062b11e2f21522095bb94b1eb4efd	2011-01-05 16:43:17 +09:00
satok	61e2f85e3f	Add profiler for native dictionary code Change-Id: I2569756c9ef4fa677ae52f2ccfcb90d2115d129f	2011-01-05 15:47:29 +09:00
satok	54fe9e0e20	Suggest words with excessive chars out of proximity chars Bug: 3273807 Change-Id: Ib8f48e562bcf4c2aac0ad5cb46809fd5f539a322	2010-12-13 17:44:14 +09:00
satok	a3d78f606e	Suggest words with transposed chars Bug: 3193883 Change-Id: I884b669258bfc522bc04e14f22a7646164a4cac5	2010-12-10 18:34:23 +09:00
satok	e07baa6fab	Limit the suggestions with an excessive character by filtering proximity characters Change-Id: Iad26dad545f1a431aa0fa53f99198b27defd03a3 ug: 3269482	2010-12-10 00:47:37 +09:00
satok	aee09dc5fa	Fix a bug that We can't suggest words with missing space if one of the words starts with a capitalized character. Bug: 3268825 Change-Id: I0634a243ad1e45dd096b30824b463c366a2e7f0f	2010-12-09 21:41:26 +09:00
satok	662fe69ba2	Suggest words with missing space Bug: 3193883 Change-Id: I8d25f3e1d4db10be733d85edfa4f55a094feef80	2010-12-09 14:26:27 +09:00
satok	cdbbea735f	Suggest excessive characters bug: 3193883 Change-Id: Iea7a0fce7ce62d8779a7c7e4613d50db30d82b07	2010-12-08 16:56:06 +09:00
satok	d299792368	Make no-recursive getWordRec Change-Id: Id90f3ca86ef490834cefa92f0d6958b1289fc633	2010-12-07 16:45:32 +09:00
satok	f5cded1c6c	Fix a crash when MAX_WORD_LENGTH is too short. Change-Id: Idcb5aa2685321b8d0ac7d846caecbd1c79e4dd77	2010-12-06 22:58:56 +09:00
satok	48e432ceb8	Breakdown getWordRec Change-Id: I4fef02c227fb858334dbe2eabf2762d5b6e1d919	2010-12-06 18:45:48 +09:00
satok	683192684c	Trim the flow of getWordRec Change-Id: Ic0cfa64ee1e55682ca73681c585db6a5cb510900	2010-12-06 14:56:11 +09:00
satok	28bd03b9f5	Breakdown getWordRec Change-Id: I8556efb1dd053eff9a9681971cbe1014abf0333f	2010-12-03 19:25:42 +09:00
satok	715514d7dd	Breakdown getWordRec and add comments Change-Id: I88bad8a4a8177e3540b995b664c47b86d6904027	2010-12-03 10:01:09 +09:00
satok	18c28f431e	Detach bigram functionarities from unigram_dictionary Change-Id: Ie35164a5f293e5370885a1ba13d6ed7caf6000ec	2010-12-02 18:24:53 +09:00
satok	e808e436cb	Refactor: Move utility functions and no suggestion functions from unigram_dictionary.cpp to dictionary.cpp Change-Id: I6f695e4f5852547d2c00de5ee54a650fef9accbe	2010-12-02 16:11:35 +09:00
satok	3008825948	Fix parameters of native functions and refactor Dictionary - created bigram/unigram dictionary classes Change-Id: I233a28ed8d611870db3f4cf8f25fc45b5d41529b	2010-12-02 01:16:44 +09:00
satok	d4952c8fe9	Move a logic for finding words with a missing character to the native code. Change-Id: I58338643830ff4f9708f78a9c26f75c8bf2ebf45	2010-12-01 19:26:36 +09:00
satok	15dc33d9f6	Add an easy way to output native debug logs Change-Id: Ieff2b8e60c5e7dedb7f86e17f7c37b349a912ab4	2010-12-01 15:56:17 +09:00
Jae Yong Sung	80aa14fd43	- separate dict (uses xml) - retrieve bigrams that only starts with character typed and neighbor keys - contacts bigram - performance measure bug: 2873133 Change-Id: If97c005b18c82f3fafef50009dd2dfd972b0ab8f	2010-07-28 11:08:08 -07:00
Jae Yong Sung	937d5ad013	added bigram prediction - after first character, only suggests bigram data (but doesn't autocomplete) - after second character, words from dictionary gets rearranged by using bigram - compatible with old dictionary - added preference option to disable bigram Change-Id: Ia8f4e8fa55e797e86d858fd499887cd396388411	2010-07-13 11:33:39 -07:00
Ken Wakasa	826269c8ae	Get rid of dependency on native AssetManager API. Confirmed the native code builds with the NDK r3. Change-Id: I0d2d3a0e262847d6948a0336a35440e21e312ad2	2010-04-27 22:23:03 +09:00
Ken Wakasa	f1abb8ce3c	Get rid of code taken from bionic to avoid license issue. Change-Id: If96f4247edbc7b1e9f7418d2ddef191618a54ae3	2010-04-23 01:24:09 +09:00
Ken Wakasa	707505ec18	A part of efforts of unbundling LatinIME: Get rid of ICU dependency in the native code. This is actually a back merge from the LatinIME sandbox. Please refer to http://arvarest.i.corp.google.com:8080/#change,77 Change-Id: I3ff3781903d5c642c662c2d744f808be7e4d8997	2010-04-21 22:43:17 +09:00
Amith Yamasani	07b1603a3f	Don't let the native code target be included twice when unbundling. Move java code to a different directory so that the unbundled version doesn't try to compile the native code again. Change-Id: I05cf9e643824ddc448821f69805ccb0240c5b986	2010-03-09 15:01:09 -08:00

1 2 3

135 commits