LatinIME

Commit Graph

Author	SHA1	Message	Date
satok	10266c09ec	Combine the skipped and transposed correction bug: 4170136 Change-Id: I7b50b40478abf27f51ec5e001815ff4882f3e5e5	2011-08-23 23:40:29 +09:00
satok	208268d149	Add correction state. Change-Id: I0a1419922e1ce7a15b566d1b6da3794f8e84c754	2011-08-10 19:10:26 +09:00
satok	cfca3c6317	Refactor CorrectionState to Correction Change-Id: I5f1ce35413731f930b43b1c82014e65d9eaa240b	2011-08-10 14:40:25 +09:00
satok	8876b75ca1	Move scoring part to the correction state Change-Id: I2dc4a0869636fce5526f48b3a6267b6bdf61dbfb	2011-08-05 17:24:56 +09:00
satok	4e4e74e6b6	Move the input index and output index to correction state Change-Id: Idebdb59143f3367929df6a0475cefe941eb16d01	2011-08-04 14:16:14 +09:00
satok	0f6c8e8aeb	Move code related to ranking algorithm to correction_state.cpp Change-Id: I52b34de45969fef82e46d9c10079c2d45e0b94eb	2011-08-03 20:34:19 +09:00
satok	612c6e49c0	Move code related to ranking algorithm to the correction state Change-Id: I2d9e2db81cf6597ca4e88d7bc6737ab3b52b34b2	2011-08-02 15:44:59 +09:00
satok	db2c0919cf	Remove old dictionary format code Change-Id: Ic4b9e069c9bd5c088769519f44d0a9ea45acb833	2011-08-01 16:01:54 +09:00
satok	2df3060883	Add correction state Change-Id: I0d281cede1590893bd1def005cf83c9431d12750	2011-08-01 15:42:09 +09:00
Jean Chalard	6a0e9642a8	Small native refactoring. Move a purely dictionary-format-related function that is needed both by unigrams and bigrams to the binary format handling file. Also remove the empty UnigramDictionary::getBigrams placeholder function, on grounds that it should be in the BigramDictionary class. Bug: 5046459 Change-Id: I8a67a25f72122e2fa0b19ae1d936db25eb0b20ba	2011-07-26 16:13:53 +09:00
Jean Chalard	999ba61b34	Some native cleanup Take a function that does not need to be a member and make it static inline. Also replace the return value of -1 by a #define'd constant. Change-Id: I92e0deaa1df65998b76aba6329a4c8eb4d287485	2011-07-22 18:09:48 +09:00
satok	d24df43eaf	(Step2)Move functions related to proximity to proximity_info.cpp Change-Id: Iae0eb2a5cd758bda820fa42b4bc3eb3d2665bf96	2011-07-14 15:47:32 +09:00
satok	1d7eaf8462	(Step 1) Move proximity related parameters from unigram_dictionary to proximity_info Change-Id: Ic630b35f4abffeb84c38bcf5935795b7ff07556a	2011-07-14 13:21:34 +09:00
Jean Chalard	1059f27364	New dict format, step 7 This actually implements the new dictionary format, but does not activate the implementation through #defines. Bug: 4392433 Change-Id: I9b26b9bcb4b823a36e0984799b69730acfc6f7f3	2011-07-13 14:33:48 +09:00
Jean Chalard	bb15e77511	Move a function to make next commit more readable Change-Id: Ieaa935ff4d68ce88137dcc5c672a4149a4c9c64f	2011-06-30 20:14:38 +09:00
Jean Chalard	0584f02ee1	Rename parameters for future change Change-Id: Id15a17340fb26f91c72687f30bef24b2d8b94940	2011-06-30 19:23:16 +09:00
Jean Chalard	432789ac93	Internal cleanup Moving functions around, renaming parameters Change-Id: I3ab480f483d7d9700b9328cb07b16b51005098e5	2011-06-30 17:50:48 +09:00
Jean Chalard	ffefdb6c1a	Cleanup. Function renaming, moving around for future patch readability Change-Id: Id33b961cf2e899b5a3c9189951d2199aba801666	2011-06-30 17:22:19 +09:00
Jean Chalard	980d6b6fef	Internal cleanup. Function renaming, useless function supressing, fix comments Change-Id: I148acbaf367cd556a85b89016676b46cc971af81	2011-06-30 17:02:23 +09:00
Jean Chalard	594a9a1963	Internal cleanup. Removed unused function prototypes. Change-Id: Ia56ea8e285deed17ce8377df855b045b7850d58d	2011-06-30 16:51:17 +09:00
Ken Wakasa	ce9e52a12a	Clean up in LatinIME native code Change-Id: I0062200a0181a491690115ac0fab8d11358e2f14	2011-06-18 23:52:09 +09:00
Jean Chalard	ca5ef2890e	New dict format, step 4 Consolidate terminal cases, streamline the word adding process and create the entrances for adding alternate spellings with an empty implementation. Bug: 4392433 Change-Id: I781c93ec49945d71c7c20624c86596aa49add4c8	2011-06-17 20:59:21 +09:00
Jean Chalard	581335c3fb	Fix a bug where bigram search would never return Bug: 4690487 Change-Id: Ie8f3f651508cc48bbb043a0b308f7e0d1524371c	2011-06-17 12:45:17 +09:00
Jean Chalard	17e44a72e8	New dict format, step 3 Some refactoring and add of a parameter that will be necessary. Bug: 4392433 Change-Id: I17f001a7efd4f69f4c35f94ee1ca8e97391b81d5	2011-06-16 23:28:09 +09:00
Jean Chalard	8124e64dcc	New dict format, step 2 Move some methods around and make static some methods Bug: 4392433 Change-Id: I2bbe98aec118a416d21d1e293638e1d324505b9b	2011-06-16 22:33:41 +09:00
Jean Chalard	293ece0f34	New dict format, step 1 This renames some variables and removes dependancies to values that will disappear Bug: 4392433 Change-Id: I79a44462d6bf25248cc2de0d63d7918fc6925d68	2011-06-16 22:18:10 +09:00
satok	d8db9f86d0	Fix a bug on the calculation of the freq on the mistyped space error correction Bug: 4402942 Change-Id: I0b611e3d0e8c25ca528ef7408c3949200e5cad85	2011-05-18 18:36:54 +09:00
satok	3c4bb7747d	A bug fix for the mistyped space algorithm Bug: 3311719 -- also fixed compiler warnings Change-Id: I6941c0d02f10d67af88bc943748dde8d8783fabb	2011-03-04 23:25:48 -08:00
Jean Chalard	eaecb56f94	Merge "Demote skipped characters matched words with respect to length." into honeycomb-mr1	2011-03-04 22:43:16 -08:00
satok	817e517e46	Add the suggestion algorithm of words with space proximity Bug: 3311719 Change-Id: Ide12a4a6280103c092fa0f563dd5b9e3f7f5c89b	2011-03-04 20:37:18 -08:00
Jean Chalard	07a8406bc1	Demote skipped characters matched words with respect to length. Words that matched user input with skipped characters used to be demoted in BinaryDictionary by a constant factor and not at all in those dictionaries implemented in java code. To represent the fact that the impact of a skipped character gets larger as the word is shorter, this change will implement a demotion that gets larger as the typed word is shorter. The demotion rate is (n - 2) / (n - 1) where n is the length of the typed word for n >= 2. It implements it for both BinaryDictionary and java dictionaries. Bug: 3340731 Change-Id: I3a18be80a9708981d56a950dc25fe08f018b5b89	2011-03-05 13:20:19 +09:00
Jean Chalard	a787dba83b	Fix a bug with umlaut processing. Issue: 3275926 Change-Id: Ibcb00aaea3ff05ad59ad4e8e54dd3caab5ab9bca	2011-03-04 13:07:07 +09:00
Jean Chalard	c2bbc6a449	Use translation of fallback umlauts digraphs for German. For German : handle "ae", "oe" and "ue" to be alternate forms for umlaut-bearing versions of "a", "o" and "u". Issue: 3275926 Change-Id: I056c707cdacc464ceab63be56c016c7f8439196c	2011-03-03 11:52:23 +09:00
satok	8fbd552292	Add proximity info to native Bug: 3311719 Change-Id: Ie596304070e321ad23fb67a13bf05e2b6af1b54b	2011-02-23 23:04:00 +09:00
Jean Chalard	f5f834afcd	Rename variables with obscure names. The `snr' variable has a very obscure name. Rename it to `matchWeight'. Also, the `toLowerCase' function is error-prone, since it actually returns a lower case version of the BASE char, that is without diacritics. Hence, rename it to `toBaseLowerCase' and update variables with similar names. Change-Id: Ibdbe73018a33ee864db59a51d664c3b104d5fb3f	2011-02-22 16:43:19 +09:00
Tadashi G. Takaoka	887f11ee43	Remove next letters frequency handling Bug: 3428942 Change-Id: Id62f467ce4e50c60a56d59bf96770e799a4659e2	2011-02-17 13:59:41 +09:00
Jean Chalard	8dc754a411	Promote full matches with differing accents. Stop considering accented characters as different from their base character for proximity scoring. Also give a huge boost (basically overriding frequency) to a word fully matched with only differing accents. Bug: 2550587 Change-Id: I2da7a71229fb3868d9e4a53703ccf8caeb6fcf10	2011-01-27 17:29:24 +09:00
satok	58c49b9132	Fix auto-correction threshold and promote full matched words Bug: 3374359 Bug: 3278422 "zbe" will be auto corrected to "be" by fixing s-line "teh" will be auto corrected to "the" by promotion of full matched words Change-Id: I314c632820e4e0b1501edeca60ada205d291451f	2011-01-27 12:53:13 +09:00
Ken Wakasa	e90b333017	Load main dic in native Follow up to Id57dce51 bug: 3219819 Change-Id: I00e11ef21d0252ffa88c12dffb9c55b0f2e19a66	2011-01-07 19:51:45 +09:00
satok	54fe9e0e20	Suggest words with excessive chars out of proximity chars Bug: 3273807 Change-Id: Ib8f48e562bcf4c2aac0ad5cb46809fd5f539a322	2010-12-13 17:44:14 +09:00
satok	a3d78f606e	Suggest words with transposed chars Bug: 3193883 Change-Id: I884b669258bfc522bc04e14f22a7646164a4cac5	2010-12-10 18:34:23 +09:00
satok	e07baa6fab	Limit the suggestions with an excessive character by filtering proximity characters Change-Id: Iad26dad545f1a431aa0fa53f99198b27defd03a3 ug: 3269482	2010-12-10 00:47:37 +09:00
satok	aee09dc5fa	Fix a bug that We can't suggest words with missing space if one of the words starts with a capitalized character. Bug: 3268825 Change-Id: I0634a243ad1e45dd096b30824b463c366a2e7f0f	2010-12-09 21:41:26 +09:00
satok	662fe69ba2	Suggest words with missing space Bug: 3193883 Change-Id: I8d25f3e1d4db10be733d85edfa4f55a094feef80	2010-12-09 14:26:27 +09:00
satok	cdbbea735f	Suggest excessive characters bug: 3193883 Change-Id: Iea7a0fce7ce62d8779a7c7e4613d50db30d82b07	2010-12-08 16:56:06 +09:00
satok	d299792368	Make no-recursive getWordRec Change-Id: Id90f3ca86ef490834cefa92f0d6958b1289fc633	2010-12-07 16:45:32 +09:00
satok	48e432ceb8	Breakdown getWordRec Change-Id: I4fef02c227fb858334dbe2eabf2762d5b6e1d919	2010-12-06 18:45:48 +09:00
satok	683192684c	Trim the flow of getWordRec Change-Id: Ic0cfa64ee1e55682ca73681c585db6a5cb510900	2010-12-06 14:56:11 +09:00
satok	28bd03b9f5	Breakdown getWordRec Change-Id: I8556efb1dd053eff9a9681971cbe1014abf0333f	2010-12-03 19:25:42 +09:00
satok	715514d7dd	Breakdown getWordRec and add comments Change-Id: I88bad8a4a8177e3540b995b664c47b86d6904027	2010-12-03 10:01:09 +09:00

1 2

53 Commits (55072fefe601a08511df1b6387d918ca5831d2c7)