Dutch spell checker
-
ANdre
Dutch spell checker
Hi! Will there be a larger dutch spellchecker in the update? The one that is in it now is not sufficiant enough. And what about grammar? Will there be help for that also in the future?
btw... when is the update (around) coming? I am looking forward to it very much!
greetings,
André
btw... when is the update (around) coming? I am looking forward to it very much!
greetings,
André
Re: Dutch spellchecker
Greetings--
No update to the Dutch Spellchecker is to be expected in the immediate future. The English spellchecker will be completely revised and augmented in the next release.
The spellchecker engine will also be vastly improved. It is now a proprietary engine.
If you are a registered user of Atlantis, you will receive notice of any update to Atlantis, when it is available.
If you are not a registered user of Atlantis, but want to be kept informed of any major development, you can subscribe to our newsletter at http://www.rssol.com/en/html/mailinglist.htm
Cheers
Robert
No update to the Dutch Spellchecker is to be expected in the immediate future. The English spellchecker will be completely revised and augmented in the next release.
The spellchecker engine will also be vastly improved. It is now a proprietary engine.
If you are a registered user of Atlantis, you will receive notice of any update to Atlantis, when it is available.
If you are not a registered user of Atlantis, but want to be kept informed of any major development, you can subscribe to our newsletter at http://www.rssol.com/en/html/mailinglist.htm
Cheers
Robert
Re: Dutch custom dictionary
Greetings--andre wrote: I have dutch dictionary file from another (free) wordprocesor. It is a tekst document format file. Can I use this one instead of the standard in Atlantis?
You can add as many words as you like to your custom dictionary from the Atlantis document window. If you right-click a word red-underlined by the spellchecker, you can choose "Add to Dictionary". This will add the word to your custom dictionary.
Now if you have a whole list of words to add, you can add all of them at one go. Here is how to proceed:
1. Open your Windows file manager (Windows Explorer by default).
2. Navigate to the Atlantis home folder.
3. Locate the “…\SpellCheck\Lex” subfolder under the Atlantis home folder.
4. You should find a file named “userdicdutch.tlx” or something like that. It is your Dutch custom dictionary.
5. Open this file in the Windows NotePad or in any other pure text editor.
6. Add your Dutch word list to it. Important! Your list must include only one word by paragraph line. Like so:
acetimetry
acetometry
acidimetry
actinometry
aerometry
alcoholometry
algometry
alkalimetry
alloiometry
allometry
altimetry
anemometry
7. Save the file and close NotePad.
When you create a Dutch document, all the words included in “userdicdutch.tlx” will be used by Atlantis to spellcheck your text.
Cheers
Robert
-
andre
Re: Dutch custom dictionary (2)
Greetings--andre wrote: I did what you told, but after adding the file into the user dialog for dictionaries, Atlantis is freezing. Maybe the file is too big? The file userdicdu is 1621 kb.
I don't know what you mean exactly by "after adding the file into the user dialog for dictionaries".
But if you inserted a whole file directly into the Atlantis user dictionary dialog, you probably created a corrupt user dictionary, that is a corrupt user dictionary file.
If this is what you did, I suggest that you do exactly as follows:
1. Open your Windows file manager (Windows Explorer by default).
2. Navigate to the Atlantis home folder.
3. Locate the “…\SpellCheck\Lex” subfolder under the Atlantis home folder.
4. You should find a file named “userdicdutch.tlx” or something like that. Its (default) name begins with "userdic". It is your Dutch custom dictionary.
5. Open this file in the Windows NotePad or in any other pure text editor.
6. Press Ctrl+A to select the whole content of your Dutch custom dictionary.
7. Press the Del key. This will empty your Dutch custom dictionary of all entries.
8. Add your Dutch word list to it. Above all, do not forget that your list must include only one word by paragraph line. Each line contains only one word and is separated from the next by a paragraph end mark, like so:

Note that "paragraph end" marks are created by pressing the Enter key on your keyboard.
Also note that "line break" marks are completely different from "paragraph end" marks.
"Line break" marks are created by pressing "Shift Enter" on your keyboard. A user dictionary with "Line break" marks instead of "paragraph end" marks would look like this:

Finally, note that your Dutch custom dictionary must include no other punctuation marks than "paragraph end" marks. Custom dictionaries cannot include commas, colons, semi-colons, line break marks and the like. They cannot include space characters either. They can only include "paragraph end" marks as illustrated in the first picture above.
If your file contains "line break" marks or any unauthorised formatting symbol or punctuation marks, you must remove them before adding your word list.
If your file contains "line break" marks, you should replace them with "paragraph end" marks. You can use the Atlantis Find/Replace dialog to do so (Ctrl+H). There are special buttons to insert "line break" marks and "paragraph end" marks in this dialog.
9. Save your Dutch custom dictionary and close NotePad.
Hope this helps.
Robert
-
Andre
Hi!
Thanks for your help, I did as you said inthe previous mail,but I see now that there is anonther problem in that file. Every word is on a new line, but many words are for some reason with the / symbol plus some capital letters after that. For example: Aalsmeer/DS
I could easily delete all the / with the find replace option, but is there also a way to remove all the capital letters at the end of the words? I could do it by hand, but there are more then 2000 pages...
André
Thanks for your help, I did as you said inthe previous mail,but I see now that there is anonther problem in that file. Every word is on a new line, but many words are for some reason with the / symbol plus some capital letters after that. For example: Aalsmeer/DS
I could easily delete all the / with the find replace option, but is there also a way to remove all the capital letters at the end of the words? I could do it by hand, but there are more then 2000 pages...
André
Re: Dutch custom dictionary (3)
Greetings--Andre wrote: Thanks for your help, I did as you said inthe previous mail,but I see now that there is anonther problem in that file. Every word is on a new line, but many words are for some reason with the / symbol plus some capital letters after that. For example: Aalsmeer/DS
I could easily delete all the / with the find replace option, but is there also a way to remove all the capital letters at the end of the words? I could do it by hand, but there are more then 2000 pages...
It all depends on how many different strings you have to remove.
If there are only a few, you could simply use the NotePad Find/Replace dialog (Ctrl+H).
For example, let's suppose that you want to remove all "/DS" strings from your word list. Here is how to proceed:
1. First open your word list file in NotePad and press Ctrl+H.
2. In the Find box, enter "/DS" (without the quotes of course).
3. Make sure that the Replace box is empty.
4. Press "Replace All" to run the Find/Replace operation.
Proceed in the same way with any other string that you want to remove. Simply repeat steps 1 to 4 above but enter a different string in the Find box.
Now let's suppose that you want to remove all unnecessary space characters from your word list. Here is how to proceed:
1. Open your word list file in NotePad and press Ctrl+H.
2. Click in the Find box, then press the spacebar on your keyboard.
3. Make sure that the Replace box is empty.
4. Press "Replace All" to run the Find/Replace operation.
Note that you might also have to enter double space characters if your word list contains such doubles.
This should remove all space characters from the file.
Hope this helps.
Robert
Even if your custom dictionary is properly composed, the main problem is that the spellcheck engine of the current public version of Atlantis is unable to process such large custom dictionaries. The spellcheck engine of the incoming version of Atlantis will have no problems with opening such dictionaries.
But if you have ideas on improving the Dutch spellchecker, we could get in touch when the new version is available for betatesting.
But if you have ideas on improving the Dutch spellchecker, we could get in touch when the new version is available for betatesting.
-
andre
Hi!
The free Dutch spell checker I tried to use to put into the userdic, came from the OpenOffice word processor. The spellchecker is created by MySpell http://packages.debian.org/unstable/text/myspell-nl and is also used by Mozilla (the web browser and webpage creator).
After installing that prg I saw that there was a txtfile for the Dutch spelling, but it was mixed with all kind of codes and for Atlantis, for now, is the file too big to use. This file contains 120975 words.
I also found the prg Freespell http://hcidesign.com/freespell/
it is a little spellchecker, where I also found a Dutch txtfile, this file contains 199272 words and is free from strange not wanted codes. It is exactly the way you described above as Atlantis needs it. But this file is even bigger then the other one (2434 kb) and so not usable right now in Atlantis.
From both, I don't know if the quality is good, but it could be an improvement for the Dutch checker.
Hope I can use it in the new version
greetings,
André
The free Dutch spell checker I tried to use to put into the userdic, came from the OpenOffice word processor. The spellchecker is created by MySpell http://packages.debian.org/unstable/text/myspell-nl and is also used by Mozilla (the web browser and webpage creator).
After installing that prg I saw that there was a txtfile for the Dutch spelling, but it was mixed with all kind of codes and for Atlantis, for now, is the file too big to use. This file contains 120975 words.
I also found the prg Freespell http://hcidesign.com/freespell/
it is a little spellchecker, where I also found a Dutch txtfile, this file contains 199272 words and is free from strange not wanted codes. It is exactly the way you described above as Atlantis needs it. But this file is even bigger then the other one (2434 kb) and so not usable right now in Atlantis.
From both, I don't know if the quality is good, but it could be an improvement for the Dutch checker.
Hope I can use it in the new version
greetings,
André
-
andre
Hi,
I just tried something else to get a wordlist, maybe it is to naive to create a good list but it seems to work. I copied all the text out of a PDF novel file, put it into Atlantis, and replaced all the spaces by a paragraph ending so I got a long list of words. After that, I put the wordlist in alphabetical order and removed commas, periods etc. I have a clean list of words now that I can use in the userdic. The only thing is now, that there are a lot of double words in the list. Is there a way to get these out quickly?
I would love to help with the Dutch dictionary option in a beta version!
Best wishes,
André
I just tried something else to get a wordlist, maybe it is to naive to create a good list but it seems to work. I copied all the text out of a PDF novel file, put it into Atlantis, and replaced all the spaces by a paragraph ending so I got a long list of words. After that, I put the wordlist in alphabetical order and removed commas, periods etc. I have a clean list of words now that I can use in the userdic. The only thing is now, that there are a lot of double words in the list. Is there a way to get these out quickly?
I would love to help with the Dutch dictionary option in a beta version!
Best wishes,
André
Re: Dutch custom dictionary (4)
Greetings--andre wrote: The only thing is now, that there are a lot of double words in the list. Is there a way to get these out quickly?
The next version of Atlantis has an option to remove duplicate items when sorting paragraphs.
Until this next version of Atlantis is released, you can use a free utility found on the Web. Here is how to proceed:
1. Go to the following Web page:
http://javascript.internet.com/miscella ... ail-l.html
2. Press the "Highlight All" button on that page.
3. Right-click the highlighted source file, then choose "Copy".
4. Paste the Windows clipboard contents into a new NotePad document.
5. Save this document as "Remove Duplicates.htm". Or use any other convenient name. But the extension must be the ".HTM" extension.
6. Close NotePad.
7. Double-click "Remove Duplicates.htm" in Windows Explorer to open it in your default Internet browser. You should get this:

8. Paste your list into the above browser page window.
9. Press the "De-Dupe list!" button. This will remove all duplicate entries from your original list.
10. Copy the new list from the browser page.
11. Paste the clipboard contents into a new NotePad document and save the list as a ".TXT" file.
Cheers
Robert
-
andre
-
andre
Please click here to subscribe to the betatesting news.