LatinIME

Author	SHA1	Message	Date
Yuichiro Hanada	c922c8a504	Add DictEncoder. Change-Id: I41049b9118b58838e5dedf8e5618d939ca70c5ef	2013-08-22 11:53:41 +09:00
Yuichiro Hanada	558e34c7bd	Make readPtNode be called with the address from the beginning of the file. Change-Id: I8939fdfb4f79e55bcd7393633784effb30df3f8f	2013-08-21 20:02:18 +09:00
Yuichiro Hanada	a306e08753	Rename BinaryDictEncoder to BinaryDictEncoderUtils. Change-Id: I4dabf17da7003b1d8204a83dbd10e5be6e8fd805	2013-08-21 18:54:34 +09:00
Yuichiro Hanada	107a5f6fb8	Add PtNodeReader. Change-Id: Ic918822fc1b3a8a7c39ffbcf7defde2c5bf888db	2013-08-21 18:43:18 +09:00
Yuichiro Hanada	065aad9501	Add DictDecoder. Change-Id: Ia1c32f21fe07081ce04d093660e18146b93275a4	2013-08-20 17:43:13 +09:00
Yuichiro Hanada	112257e40f	Rename BinaryDictDecoder to Ver3DictDecoder. Change-Id: Ibf9b95b658df6e2c2218bdb62e2380f326a03832	2013-08-20 17:11:51 +09:00
Yuichiro Hanada	66004ce2de	Remove populateOptions. Change-Id: I1a1830aaa8ea586b68fc34ff3a27ae52b810e8af	2013-08-20 16:06:52 +09:00
Yuichiro Hanada	77bce05e6f	[Refactor] Rename BinaryDictReader and BinaryDictDecoder. BinaryDictReader -> BinaryDictDecoder. BinaryDictDecoder -> BianryDictDecoderUtils. Change-Id: Iadf2153b379b760538ecda488dda4f17225e5f37	2013-08-19 19:36:31 +09:00
Yuichiro Hanada	d794b42f98	Add HeaderReaderInterface. Change-Id: I298f86b70d18cd08b240509b6f757c72e1a59ffe	2013-08-19 11:15:03 +09:00
Yuichiro Hanada	3a73b37b30	Make BinaryDictIOUtils and DynamicBinaryIOUtils use BinaryDictReader. Change-Id: I191dfe0e05ff3c2c5af99e8beebbb73b097748a3	2013-08-16 21:06:23 +09:00
Yuichiro Hanada	e72c4e5fc7	Remove a static buffer for thread safety. Change-Id: I335c35eb182ff63abb8a5b04c053a98d44b7c6ce	2013-08-16 20:22:46 +09:00
Ken Wakasa	47bac6ebf2	Merge "Remove unnecessary caching."	2013-08-16 08:28:19 +00:00
Yuichiro Hanada	6e26cc3f5d	Remove unnecessary caching. Change-Id: Ic4ccab9d344b30b72fca1503827eec1c628fa4ac	2013-08-16 17:10:45 +09:00
Jean Chalard	af30cbf0ee	Rename Node to PtNodeArray Bug: 10247660 Change-Id: I1a0ac19f58f96adb5efac5fd35c6404831618c99	2013-08-16 16:24:54 +09:00
Yuichiro Hanada	94460eba11	[Refactor] Divide BinaryDictInputOutput into BinaryDictEncoder and BinaryDictDecoder. Change-Id: I7c3269d77e3e3b567e459dcaa1bc029903941744	2013-08-15 20:23:07 +09:00
Jean Chalard	e7870a2c0d	Add an initial JNI interface to dicttool. Bug: 10100269 Change-Id: I883992c2033e7d9e7c754c0bf653767728b221b6	2013-08-15 17:58:55 +09:00
Ken Wakasa	117f18e844	Revert "[Refactor] Divide BinaryDictInputOutput into BinaryDictInputUtils and BinaryDictOutputUtils." This reverts commit `4c63d0614e`. Change-Id: I1fa277d720bab4d895259df7d6d82eebfa5eb6c5	2013-08-15 08:54:29 +00:00
Yuichiro Hanada	4c63d0614e	[Refactor] Divide BinaryDictInputOutput into BinaryDictInputUtils and BinaryDictOutputUtils. Change-Id: I0d476abe763c11ba9005152f928e8dccf15ac9de	2013-08-15 15:46:58 +09:00
Yuichiro Hanada	1db93c9c04	[Refactor] Move some helper methods to BinaryDictIOUtils. Change-Id: Ib817a975dc1f82241f732b236c44b042fda25b3c	2013-08-15 10:49:40 +09:00
Yuichiro Hanada	3edb62c69b	Move some methods in BinaryDictIOUtils to DynamicBinaryDictIOUtils. Change-Id: I9ba55582c533fef0eb3e60c46bf23c8b16ee1ff4	2013-08-14 19:33:36 +09:00
Ken Wakasa	f795f2b789	Merge "Add FusionDictionaryBufferFromWritableByteBufferFactory."	2013-08-14 10:26:21 +00:00
Yuichiro Hanada	665592774c	Move some constants in BinaryDictInputOutput to FormatSpec. Change-Id: I6b12faf35b65238b9a64c82d4d1a6050f980e72e	2013-08-14 19:19:27 +09:00
Yuichiro Hanada	bbc8a930f7	Add FusionDictionaryBufferFromWritableByteBufferFactory. Change-Id: I23de0a178e7f11f2cf301fd433cde60c6152055b	2013-08-14 17:07:44 +09:00
Yuichiro Hanada	3feacba1eb	Add BinaryDictReader. Bug: 9618601 Change-Id: Ief07fa0c3c4f7f5999a3fafcef4e47b6b6fd8143	2013-08-13 19:55:05 +09:00
Yuichiro Hanada	b7bb9c9722	Make readHeader check the header size before using it. Change-Id: I5dc3e2b674f7343ef57317fde6bdb7349a7fe04c	2013-08-13 17:06:25 +09:00
Yuichiro Hanada	7ec9db2c34	Remove the code and comments about version 1 format. Change-Id: I827052f234eeaa4dbcfd37da69a99866896a158b	2013-08-09 16:05:07 +09:00
Yuichiro Hanada	7d1ae52ded	Fix unit tests. Change-Id: Ic0013089625e112aaccc888d462330640ef7cc6f	2013-08-08 19:12:35 +09:00
Jean Chalard	93445b4821	Fix some warnings Change-Id: I7290cd1fb675a1b85b9b6ac2d464c932b5bca1dd	2013-07-31 16:17:01 +09:00
Satoshi Kataoka	ffcbbaf127	Refactor on UserHistoryDictionary Bug: 9429906 Change-Id: I576a91643bdaf5017cc826ac2e07a74a9a275d60	2013-07-26 13:00:19 +09:00
Jean Chalard	25de86a6a2	[FD4] Separate cached address before/after update for groups This should fix bug#8526576 for good. Bug: 8526576 Change-Id: I473aad26b69d64efa09d2ec9d8e69f29f5cf4819	2013-07-24 18:40:14 +09:00
Jean Chalard	429db8d61e	[FD3] Split stackNodes into two methods. In the future we need to have a method that computes only from the size, as we used to have, to initialize the cached addresses, and a much simpler and faster method to copy the cached sizes. Bug: 8526576 Change-Id: I6a5a790303ab8f3bf957c7ca266eb12da7c1ad9e	2013-07-24 17:26:16 +09:00
Jean Chalard	91cbe3566d	[FD2] Separate cached address before/after update for nodes. Bug: 8526576 Change-Id: Ib9f8594a9e12dc75eba296faff2612c4bd7483d3	2013-07-23 17:52:54 +09:00
Jean Chalard	257750d988	[FD1] Move parents' address computation outside There is no need to do it repeatedly in this loop: it's clearer and faster to do it at the end only. Bug: 8526576 Change-Id: I707571179c89479830891ec6d4fd06a9fffed7c1	2013-07-17 20:47:53 +09:00
Jean Chalard	c2e9c511cb	Fix Binary dict tests There are two problems here. The first one is the tests would send an invalid unicode character. Although we could want dicttool to handle this more gracefully, it's fine for now. The second problem is much more serious. If a node has more than 128 children, then the java code will crash trying to read the dictionary back because of a bug that this change fixes. In theory, it's possible that happens when we try to load the user history dictionary back from the disk - native code is not affected so there is no other point that may cause a problem. In the practice, that means you'd need to have 129 words with a common prefix (including empty string) but all different after this. It's almost impossible with Google Keyboard since there are only so many keys on the keyboard that you can make a word out of, and then again you'd have to do it repeatedly until it actually enters the user history dictionary, wait for it to get saved on the disk. The bad news is, if you manage to get this far, the keyboard will crash every time and won't be able to get up until you clear data for the package. The good news is, the dictionary itself is not corrupted and only the reading code is wrong. So updating to a newer version would actually even recover from this situation. All in all, considering how almost-impossible this is to trigger, I don't think even a single user actually did hit this bug. Bug: 8583091 Change-Id: Iabb2a7f47cbd9ed3193d2a3487318d280753e071	2013-04-15 12:48:16 +09:00
Jean Chalard	ca0fdbbe2e	Fix two bugs in dicttool Both bugs only affect debug mode. One has the wrong object tested with equals, the other has the iteration failing in some cases. Change-Id: Ie9100d257a3f9e3be340cf3e38116f63417bdc1a	2013-04-10 22:10:31 +09:00
Jean Chalard	a411595b16	Fix two nasty bugs with surrogate pairs. The important bug is in findWordInTree. The problem, which is not obvious, is that we were calling codePointAt() with the code point index in the string, instead of the char index. The other bug this change fixes was harmless in the practice, because it's in the iteration which is only used for debug and pretty printing purposes. It's very similar in that it would substract a length in code point to a length in chars and truncate a StringBuilder at that length, so it would fail in a quite similar manner. This changes the meaning of the "length" attribute in Position, but it's clearer this way anyway. Bug: 8450145 Change-Id: If396f883a9e6449de39351553ba83f5be5bd30f0	2013-04-01 17:06:19 +09:00
Jean Chalard	c6799ffeab	Send the dictionaries descriptions to the dict pack Bug: 8255795 Change-Id: I12a5922f50c2d2e3aa639457abcc1483e6a48721	2013-02-23 01:46:39 -08:00
Jean Chalard	2521edec09	Fix a bug with the passed dictionary id We used to make the dictionary that we passed to the dictionary pack as an initial value based on the locale. This is wrong - it should be read from the dictionary. This change fixes that. Bug: 7005813 Change-Id: Ib08ed31dd9c216f6f7b9c6c3174ca514bf96e06f	2013-02-22 20:49:48 -08:00
Jean Chalard	af4a7e8c4b	Create methods in LatinIME to make the current dict lists Bug: 7005813 Change-Id: I82232af8e3071333b6fd01e4453b6b3c0a3ddb1f	2013-01-31 09:26:52 +09:00
Tadashi G. Takaoka	8aa9963a89	Fix Apache license comment Change-Id: Ic56167f952a7f4449da366e1e81610e72c966086	2013-01-21 22:23:37 +09:00
Jean Chalard	fbc5e9b334	[AD3] Implement the interface to choose a local dictionary Bug: 7702011 Change-Id: Id3b9c58dbbf5097e4d6ce986d20924eae19f9690	2013-01-21 15:40:46 +09:00
Jean Chalard	1d15fe7e51	[AD2] Add a helper method to read an arbitrary dict header Bug: 7702011 Change-Id: Ib88f6dc222892831ae6932635b65fd2595b16b43	2013-01-18 20:34:28 +09:00
Ken Wakasa	b6ca354431	Small code cleanups Multi-project commit with I249d5fbe Change-Id: Ia28c4e970992aa1299a30e604eaa5d096655c3a5	2013-01-07 12:13:42 +09:00
Ken Wakasa	45239029ce	Remove trailing spaces Change-Id: I260b85ef9e91d17f97d6e405d2d92a65b443df44	2012-12-19 15:36:55 +09:00
Jean Chalard	f1b464da31	Remove a useless member Change-Id: Id13e0aeec6ec3655d6bb0edc7f8f7821e7dc5a36	2012-12-11 19:15:24 +09:00
Jean Chalard	2da8866518	Remove a couple Eclipse and Android Lint warnings Change-Id: I0c29c5d2abcbf80759b996d34b534deb083cd7d3	2012-12-06 21:30:51 +09:00
Jean Chalard	51a0ef8c59	Add a plumbing option to dicttool info. Also align the `porcelain' option to the diff command that was used mistakenly. Bug: 7388665 Change-Id: Ic0e1b98c62ce37b2e909384a0370af4458563703	2012-10-31 16:35:22 +09:00
Jean Chalard	f41389a74b	Remove warnings Thanks Eclipse Change-Id: I88e3979ed22be5d8be5a5accdde417c6b1a8bf2d	2012-10-29 14:24:16 +09:00
Jean Chalard	a23e333079	Implement the word-level diff (A9) Bug: 7388857 Change-Id: I4c4560d4f4b579936a44cdf409a4c27300b65610	2012-10-29 12:31:22 +09:00
Jean Chalard	47cac57e45	Finish up the "info" command in dicttool. (A6) Bug: 7388857 Change-Id: I704f12a6be76ce1644ec5e8dd3b667f112e9c04a	2012-10-25 19:15:24 +09:00
Jean Chalard	b3c98901c5	Add auto detection and decoding of dictionary files. (A2) Bug: 7388852 Change-Id: I25e755fc15f5b383acc046f668e9681efa4f0c2f	2012-10-25 16:40:15 +09:00
Jean Chalard	ddb0bcc051	Fix a bug where a bigram would be ignored Bug: 7403386 Change-Id: I89f495d07f7059a9f1ccd97d487c2f2657a8ebd2	2012-10-24 13:24:59 +09:00
Jean Chalard	c59c741987	Return the correct bigram frequency The "correct" bigram frequency is now returned by the reading code. However, as the binary format represents the frequency in a lossy manner, the frequency is not guaranteed to be the exact same as the one in the source text format - only a close enough value. It is however the exact same value seen by the native code. Bug: 7395653 Change-Id: I49199ef18901c671189912b3550623e9643baedd	2012-10-23 17:17:37 +09:00
Tadashi G. Takaoka	15f6d4ae34	Add @UsedForTesting and @ExternallyReferenced annotations Bug: 7268357 Change-Id: I0b7e0c19f04af9ae30874d0a4c26ad81bc80be8c	2012-10-22 11:18:43 -07:00
Yuichiro Hanada	d2579c4832	fix writeCharGroup. Change-Id: Ib841afaba0a20c3b300eb7d3e9133243f9f3ae58	2012-10-05 14:54:17 +09:00
Yuichiro Hanada	3c6d9fe148	Add insertWord. bug: 6669677 Change-Id: Ide55a4931071de9cd42c1cddae63ddd531d2feba	2012-10-04 17:19:47 +09:00
Yuichiro Hanada	c3a98ca306	Add writeNode. Change-Id: I088bb6ea43ce0841d725e48b677d429e1155569d	2012-10-04 14:28:42 +09:00
Yuichiro Hanada	38712ff27d	Add updateParentAddresses. Change-Id: Iac210131b7c003ef363e1138bf22f777a37c6a89	2012-10-03 19:37:17 +09:00
Yuichiro Hanada	a853356b82	Add isDeletedGroup. Change-Id: I83f09c068868e5e6e1b46f494a6ef957f0b466d8	2012-10-03 02:19:41 -07:00
Yuichiro Hanada	7223cc2ef1	Add MAX_BIGRAMS_IN_A_GROUP. Change-Id: I128d5deb8e523045d7ad77d7a8fd3db944f71238	2012-10-03 18:10:06 +09:00
Yuichiro Hanada	4ad4ff618f	Add makeCharGroupFlags. Change-Id: Id2c580f21b77f66a97c5fbdf4542fdafe6c43614	2012-10-03 14:33:59 +09:00
Yuichiro Hanada	7f438aa12f	Make writeCharGroup return a size of a new group. bug: 6669677 Change-Id: I56f6a07b04b08443f2c052927404318c2018fc9d	2012-10-01 22:02:04 +09:00
Yuichiro Hanada	fb7e08ea8f	Add writeCharGroup. bug: 6669677 Change-Id: I36792ba9c511a5148c963096cc93ca8c2e0ee04e	2012-10-01 21:50:38 +09:00
Yuichiro Hanada	f3aed3ea26	Add updateChildrenAddress. Change-Id: Ic06a755d85612476e719e580469dc1cd9447286c	2012-09-28 18:45:56 +09:00
Tadashi G. Takaoka	a28a05e971	Cleanup: Make some classes as final Change-Id: I6009b3c1950ba32b7f1e205a3db2307fe0cd688e	2012-09-27 19:03:30 +09:00
Yuichiro Hanada	84d858ed5e	Use BinaryDictInputOutput to save UserHistoryDictionary. bug: 6669677 Change-Id: I08193c26f76dbd48168f8ac02c1b737525bfc7b2	2012-09-27 12:02:17 +09:00
Yuichiro Hanada	2aea34fb31	Add updateParentAddress. bug: 6669677 Change-Id: I353f8ae53720cdf7a809271a28cb703709609f53	2012-09-26 17:18:01 +09:00
Yuichiro Hanada	2ee70804e9	Add moved char groups. bug: 6669677 Change-Id: I372f841044fe8e076a50a80ac10b715e5f8fd4eb	2012-09-26 17:01:48 +09:00
Yuichiro Hanada	a161bdac88	add capacity to FusionDictionaryBufferInterface. bug: 6669677 Change-Id: I4627093811a19c46ce13fe351d1db63cbd78cf4a	2012-09-25 21:47:11 +09:00
Yuichiro Hanada	93d7c6233f	Make getTerminalPosition read linked-list nodes. bug: 6669677 Change-Id: I599d276f430efe23d402695c325e23906b7705b3	2012-09-25 21:11:15 +09:00
Yuichiro Hanada	8ec0064c49	Make children addresses and parent addresses use signed addresses. Signed addresses are used only in version 3 with dynamic update. bug: 6669677 Change-Id: Iadaeab199b5019d2330b4573c24da74d64f0945e	2012-09-25 12:55:14 +09:00
Yuichiro Hanada	82d9deaaf2	Combine mHasParentAddress with mHasLinkedListNode into mSupportsDynamicUpdate. bug: 6669677 Change-Id: I82799af199358420f09ac34fc005091e202c5d3b	2012-09-24 13:17:44 +09:00
Yuichiro Hanada	66597f5e5f	Add deleteWord. bug: 6669677 Change-Id: I1a5b90ee05e5cffd74a5c140384a3e37c79e7e70	2012-09-21 12:40:07 +09:00
Yuichiro Hanada	73779f7631	Make readUnigramsAndBigramsBinary read linked-list nodes. Change-Id: I07ae036b0b06e71d7a18f2bf11e4692cd4213568	2012-09-20 20:37:02 +09:00
Yuichiro Hanada	d36245fad2	Add getTerminalPosition. Change-Id: If04d779db23b1aea2cc12e5e9b8cecfcb35a5737	2012-09-20 18:02:16 +09:00
Yuichiro Hanada	65feee12e5	Make BinaryDictIOUtils. Change-Id: I45830235ee738233e8eb2bd91d659705b698f58c	2012-09-19 15:37:37 +09:00
Yuichiro Hanada	c2fdf0dfbf	Make readNode read linked list nodes. Change-Id: Ia5eaae0653179b2eb74c53b0823beaf80377a389	2012-09-19 14:49:23 +09:00
Yuichiro Hanada	a149c53c8e	add limit to FusionDictionaryBufferInterface. Change-Id: Ic9ff717a9751023d47b02ff3b9d1fbf3115c2501	2012-09-19 12:28:19 +09:00
Yuichiro Hanada	b686df15fc	Add a new flag for linked list nodes. Change-Id: Ib2f194775cfe5ab05481ac95cd709d6e8e8dd3c6	2012-09-18 22:01:49 +09:00
Yuichiro Hanada	bf45dc4860	Make writePlacedNode write the linked-list node. Change-Id: I60feda815ea08cf73300fccca1ae12b97550f116	2012-09-18 21:20:07 +09:00
Yuichiro Hanada	061d225fb1	Add a new option to FormatOptions. Change-Id: I8bf089bea5de46570a5e81fb1ea3ab22c07eeee1	2012-09-18 21:03:13 +09:00
Jean Chalard	ed47131612	Merge "Fix a bug with surrogate characters" into jb-mr1-dev	2012-09-18 02:06:55 -07:00
Jean Chalard	6c721b5f68	Fix a bug with surrogate characters This is a pretty bad bug :/ Bug: 7013840 Change-Id: I12c7cfa4fa9d56b2c1fee6e6222c64fe20b88fa3	2012-09-18 18:01:15 +09:00
Yuichiro Hanada	8adc0154e6	Remove populateOptions(final ByteBuffer buffer). Change-Id: Ifc4c64c9cffe4f343c5a604c192db010a1792acc	2012-09-18 14:42:52 +09:00
Yuichiro Hanada	cc958dd96e	Refactor BinaryDictInputOutput. Change-Id: Idb4b635fcac70cc988e0dd3ce3bf121fba12099c	2012-09-14 11:08:01 +09:00
Yuichiro Hanada	1a347723c5	Move FormatOptions and FileHeader to FormatSpec. Change-Id: I232e35598635113bf2c81825669c744aadc79efe	2012-09-13 16:35:41 +09:00
Yuichiro Hanada	81d97eec0e	Move constants and comments. Change-Id: Ifd66bda7d528827ba61c60531121ea206a2325be	2012-09-13 14:28:39 +09:00
Yuichiro Hanada	8d031a63b4	Add put method to FusionDictionaryBufferInterface. Change-Id: Iac0b35d2da05e81237d105e8fe13c56d16038de1	2012-09-12 15:41:21 +09:00
Yuichiro Hanada	e55b644aef	Add new binary dictionary format. Change-Id: Ia99411d4009857d5e420ca87ef8acf1f1826d3ed	2012-09-10 13:05:46 +09:00
Yuichiro Hanada	eae7b293e4	Check the length of the word when add to FusionDictionary. Change-Id: Id98d18e90a8b83b597507728b467f56888c8fd12	2012-09-10 12:35:53 +09:00
Yuichiro Hanada	83dfe0fd8c	Add FormatOptions. Change-Id: Ibad05a5f9143de1156b2c897593ec89b0a0b07e7	2012-09-05 18:05:43 +09:00
Ken Wakasa	f2789819bd	Cosmetic fixes and a bug fix in UnigramDictionary::testCharGroupForContinuedLikeness(). This change has actually been extracted from a change work in progress I4fe423834b8131fb122251892c98228a6e08ba25 Change-Id: I52568fa09da2ea22be7f8bfe9676b7cd73c31fa4	2012-09-04 14:23:37 +09:00
Jean Chalard	2035b946a3	Merge "Reinstate the shortcut-only attribute" into jb-mr1-dev	2012-09-02 19:28:01 -07:00
Jean Chalard	72b1c93941	Reinstate the shortcut-only attribute Also add the blacklist attribute Bug: 7005742 Bug: 2704000 Change-Id: Icbe60bdf25bfb098d9e3f20870be30d6aef07c9d	2012-08-31 22:11:52 +09:00
Yuichiro Hanada	666a433802	add UserHistoryDictIOUtils. Change-Id: I8a70e43b23f65b5fd5f0ee0b30a94ad8f5ef8a8a	2012-08-31 15:08:57 +09:00
Yuichiro Hanada	b2a43a2ed4	add readUnigramsAndBigramsBinary. Change-Id: I7967f11211221d4877bf0a0c30183af885f45390	2012-08-31 14:39:19 +09:00
Yuichiro Hanada	62ed901100	add readHeader. Change-Id: I5be5d62a63ca897e36fe93200ffdca6befb363aa	2012-08-30 14:17:50 +09:00
Yuichiro Hanada	f5c4ff4817	Add FusionDictionaryBufferInterface. Change-Id: I8640c994231d5f46bc6e074ce8a5bf5344fed0aa	2012-08-29 19:27:49 +09:00
Yuichiro Hanada	d4fe7fda30	Use ByteBuffer when reading FusionDictionary from file. Change-Id: Ia71561648e17f846d277c22309ac37c21c67a537	2012-08-24 13:31:08 +09:00
Jean Chalard	13822d2b05	Hack to skip reading an outdated binary file. Bug: 7005813 Change-Id: Ie0d8d4b2d5eb147838ca23bdd5ec1cecd4f01151	2012-08-20 13:56:52 +09:00
Ken Wakasa	72c0f4de1d	Merge "add reconstructBigramFrequency" into jb-mr1-dev	2012-08-17 03:19:12 -07:00
Yuichiro Hanada	c0a75c8ecb	add reconstructBigramFrequency Change-Id: Iff20dcb9ca0d6064bb118247887fe24b812c0c61	2012-08-17 19:05:16 +09:00
Jean Chalard	aa27635a8a	Reword a confusing comment Bug: 7005645 Change-Id: Ifd942b3ce242aeeec512e132e1cee31329e994b1	2012-08-17 17:22:28 +09:00
Yuichiro Hanada	0d35c159fe	fix findWordInTree. Change-Id: I8f42df28f76188677db9d4e55885e1fc6a40b53f	2012-08-17 10:23:01 +09:00
Yuichiro Hanada	66f338983b	fix findWordInTree. Change-Id: I9d81c815494a0670afa81219ad7bad82274d997e	2012-08-16 20:21:47 +09:00
Jean Chalard	54e84a00fc	Make a makedict command for dicttool (A3) This behaves exactly as the old makedict command. Further changes will redirect the calls to makedict to this, so as to consolidate similar code. Groundwork for Bug: 6429606 Change-Id: Ibeadbf48bec70f988a15ca36ebf5d1ce3b5b54ea	2012-08-04 01:11:46 +09:00
Jean Chalard	d10c473347	Small performance tweak Change-Id: Icd540742073d49d12e70b2d8bd99aaf7ccb5802d	2012-06-08 17:09:40 +09:00
Jean Chalard	7214617622	Remove a slew of Eclipse warnings. Change-Id: I03236386aea13fbd4fb8eaeee18e0008aa136502	2012-06-08 16:23:18 +09:00
Tadashi G. Takaoka	93ebf74bae	Clean up some compiler warnings Change-Id: I604da15e65fc3cf807ec4033df4e4cd5ef0196fc	2012-05-25 19:04:54 +09:00
Jean Chalard	418b343797	Use a formula packing more information into 4 bits field Bug: 6313806 Change-Id: Id0779bd69afae0bb4a4a285340c1eb306544663a	2012-05-15 18:59:21 +09:00
Jean Chalard	76319c6931	Small optimization Performance gain is < 2% Bug: 6394357 Change-Id: I2b7da946788cf11d1a491efd20fb2bd2333c23d1	2012-05-14 15:52:01 +09:00
Jean Chalard	4df5b43df8	Small optimizations Bug: 6394357 Change-Id: I00ba1b5ab3d527b3768e28090c758ddd1629f281	2012-05-14 15:51:58 +09:00
Jean Chalard	3b1b72ac4d	More optimizations We don't merge tails anyway, and we can't do it any more because that would break the bigram lookup algorithm. The speedup is about 20%, and possibly double this if there are no bigrams. Bug: 6394357 Change-Id: I9eec11dda9000451706d280f120404a2acbea304	2012-05-14 12:41:18 +09:00
Jean Chalard	12efad3d15	Some more obvious optimizations The speedup is about 15% Bug: 6394357 Change-Id: Ibd57363d9d793206dd916d8927366db4192083b6	2012-05-14 12:35:31 +09:00
Jean Chalard	47db0be7cb	Some obvious optimizations to makedict Bug: 6394357 Change-Id: Ibfd98aac2304ef50cf90b1de984736ddcfe7a4bc	2012-05-14 12:34:05 +09:00
Jean Chalard	f7346de94a	Write the bigram frequency following the new formula This also tests for bigram frequency against unigram frequency Bug: 6313806 Bug: 6028348 Change-Id: If7faa3559fee9f2496890f0bc0e081279e100854	2012-05-11 20:27:22 +09:00
Jean Chalard	4455fe2c89	Refactor a method Rename it, rename parameters, and add a parameter that will be necessary soon. Also, rescale the bigram frequency as necessary. Bug: 6313806 Change-Id: I192543cfb6ab6bccda4a1a53c8e67fbf50a257b0	2012-05-11 19:34:35 +09:00
Ken Wakasa	84478103ec	Tidy up the MakedictLog class. Follow up to I436b2b7b Change-Id: Id17b134dab2f876b874a505e92a379c8b5567fa4	2012-05-05 23:40:21 +09:00
Ken Wakasa	03b423f313	Suppress debug log from makedict in LatinIME bug: 6447900 Change-Id: I436b2b7b261b422a7edca9cb99a4689b63877fe0	2012-05-05 09:28:27 +09:00
Jean Chalard	20a6dea1ca	Add a flag for bigram presence in the header This is a cherry-pick of Icb602762 onto jb-dev. Bug: 6355745 Change-Id: Icb602762bb0d81472f024fa491571062ec1fc4e9	2012-04-26 16:40:29 +09:00
Jean Chalard	44c64f46a1	Ignore bigrams that are not also listed as unigrams This is a cherry pick of I14b67e51 on jb-dev Bug: 6340915 Change-Id: Iaa512abe1b19ca640ea201f9761fd7f1416270ed	2012-04-26 15:20:30 +09:00
Jean Chalard	805fed49e1	Merge "Fix binary reading code performance."	2012-04-23 23:39:37 -07:00
Jean Chalard	1d80a7f395	Fix binary reading code performance. This is not the Right fix ; the Right fix would be to read the file in a buffered way. However this delivers tolerable performance for a minimal amount of code changes. We may want to skip submitting this patch, but keep it around in case we need to use the functionality until we have a good patch. Change-Id: I1ba938f82acfd9436c3701d1078ff981afdbea60	2012-04-24 15:16:17 +09:00
Jean Chalard	a64a1a46e4	Fix a bug where a node size would be seen as increasing. The core reason for this is quite shrewd. When a word is a bigram of itself, the corresponding chargroup will have a bigram referring to itself. When computing bigram offsets, we use cached addresses of chargroups, but we compute the size of the node as we go. Hence, a discrepancy may happen between the base offset as seen by the bigram (which uses the recomputed value) and the target offset (which uses the cached value). When this happens, the cached node address is too large. The relative offset is negative, which is expected, since it points to this very charnode whose start is a few bytes earlier. But since the cached address is too large, the offset is computed as smaller than it should be. On the next pass, the cache has been refreshed with the newly computed size and the seen offset is now correct (or at least, much closer to correct). The correct value is larger than the previously computed offset, which was too small. If it happens that it crosses the -255 or -65335 boundary, the address will be seen as needing 1 more byte than previously computed. If this is the only change in size of this node, the node will be seen as having a larger size than previously, which is unexpected. Debug code was catching this and crashing the program. So this case is very rare, but in an even rarer occurence, it may happen that in the same node, another chargroup happens to decrease it size by the same amount. In this case, the node may be seen as having not been modified. This is probably extremely rare. If on top of this, it happens that no other node has been modified, then the file may be seen as complete, and the discrepancy left as is in the file, leading to a broken file. The probability that this happens is abyssally low, but the bug exists, and the current debug code would not have caught this. To further catch similar bugs, this change also modifies the test that decides if the node has changed. On grounds that all components of a node may only decrease in size with each successive pass, it's theoritically safe to assume that the same size means the node contents have not changed, but in case of a bug like the bug above where a component wrongly grows while another shrinks and both cancel each other out, the new code will catch this. Also, this change adds a check against the number of passses, to avoid infinite loops in case of a bug in the computation code. This change fixes this bug by updating the cached address of each chargroup as we go. This eliminates the discrepancy and fixes the bug. Bug: 6383103 Change-Id: Ia3f450e22c87c4c193cea8ddb157aebd5f224f01	2012-04-24 14:04:02 +09:00
Tom Ouyang	df7ebbbd61	Change binary dictionary output buffer size to match dictionary size. Bug: 6355943 Change-Id: Iaab7bc16ba0dbc7bfde70b06e7bd355519838831	2012-04-19 10:18:57 -07:00
Jean Chalard	f420df2823	Add support for German umlaut and French ligatures flags Bug: 6202812 Change-Id: Ib4a7f96f6ef86c840069b15d04393f84d428c176	2012-04-06 17:07:29 +09:00
Jean Chalard	b8060399c7	Remove constructors And small cleanup. Change-Id: I1de903f42c1b8d57a488be2162e0b94055a6d1f2	2012-04-06 16:53:15 +09:00
Jean Chalard	8cf1a8d04f	Remove the shortcutOnly attribute which is now useless. Change-Id: Ifccdfdaf7c0066bb7728981503baceff0fedb71f	2012-04-06 16:27:53 +09:00
Jean Chalard	c734c2aca1	Add a simple way to input dictionary header attributes Just add them as an attribute to the root of the XML node. Bug: 6202812 Change-Id: Idf040bfebf20a72f9e4370930a85d97df593f484	2012-04-03 15:18:51 +09:00
Jean Chalard	e705a122d1	Remove useless adding of shortcut as unigrams. Change-Id: I1f50ebf00d6dd0dad4114fad86ace5b7b304613a	2012-03-28 20:40:38 +09:00
Jean Chalard	752996540f	Add read support for string shortcuts for makedict. Change-Id: I48ee4fc9ac703ad2a680b3cd848de91c415ea3c8	2012-03-28 20:40:08 +09:00
Jean Chalard	3bbb31f3f0	Change the format of the shortcuts in the binary dict. This only includes the write part of the change. The read part is coming in a different commit. Change-Id: Iabe7af6cd134462dc19245f5400719920ed31c8f	2012-03-28 20:24:07 +09:00
Tom Ouyang	b163f91621	Merge "Add support for updating and adding bigrams to existing nodes."	2012-03-23 05:57:55 -07:00
Tom Ouyang	7cfe20efbe	Add support for updating and adding bigrams to existing nodes. Bug: 6188977 Change-Id: I48aca8ba199247d73395ab13b9d1976f4e739208	2012-03-23 21:52:39 +09:00
Ken Wakasa	066866954a	Add a missing comparison in Word.equals() Follow up to I94e2e29c bug: 6209651 Change-Id: Iff2daca8c2678e2d1796f98d6db738f109e3d03f	2012-03-23 14:41:16 +09:00
Ken Wakasa	9f0ea52a5d	Add missing Word.hashCode() Some cleanups too. bug: 6209651 Change-Id: I94e2e29c92e90e554e4952d277d590e093766c4f	2012-03-23 13:11:39 +09:00
Ken Wakasa	2aa02b84a4	Revive the Makefile for makedict Follow up to I4d2ef504. Address a compiler warning and a small optimization as well. bug: 6188977 bug: 6209651 Change-Id: Ibc9da51d48ebf0b8815ad0bb2f697242970ba8f7	2012-03-22 11:55:18 +09:00
Tom Ouyang	e276c2401e	Move makedict to LatinIME android keyboard. Bug: 6188977 Change-Id: I4d2ef504bb983abbda3cb52ee450cb46f58d95cf	2012-03-21 19:30:26 +09:00
satok	905670bd87	Add a dummy file and package for make dict Change-Id: I195fd42f2a773bcc6fab0a61336a1c15d97902bb	2012-03-19 15:26:13 +09:00

1 2 3 4 5

239 commits