Over-used phrases and words

Request new features or suggest modifications to existing features of Atlantis.
Post Reply
Eduardo
Posts: 110
Joined: Sun Feb 01, 2004 10:49 am

Over-used phrases and words

Post by Eduardo »

Atlantis would be perfect for writers and journalists with this feature implemented, which would search the document for over-used phrases and words. There's one program that has that implemented. Please take a look for your reference.

Thanks.

Eduardo

The program is called PageFour.

http://www.softwareforwriting.com/
User avatar
admin
Site Admin
Posts: 2926
Joined: Wed Jun 05, 2002 10:48 pm
Contact:

Post by admin »

The "overused" feature of PageFour is not of much help to writers because it does not detect "problems" within the document text. When I have 1,000 words in my document, and, let's say, 10 instances of a given word are present in this document, this does not mean at all that this given word is "overused". If there is only one instance of the given word per each 100 words in my document, this word is not "overused" even if it is the most frequently used word in this document. But in the same time this document might include only 3 instances of another word. This word would be definitely "overused" if all the 3 instances were present within the same sentence.

So only counting words or phrases is of no use. Word dispersion within the document is a key criteria for detecting problems with "overused" words.

Atlantis will have a feature for detecting "really overused" words.
Eduardo
Posts: 110
Joined: Sun Feb 01, 2004 10:49 am

Post by Eduardo »

Thank you for your reply.

In fact, the search for over-used words is more effective within certain range, specially a paragraph. In short, I agree with you totally. Such a feature would be a heavens gift for me and others. I will be extremely happy when this feature is implemented in Atlantis.

Take care.

E.
andygrunt
Posts: 230
Joined: Mon Jul 23, 2007 12:35 pm

Post by andygrunt »

admin wrote:Atlantis will have a feature for detecting "really overused" words.
If you're working on this feature, I have a request. If you've not already thought to include it, could you add the ability to search for closely related words as well as duplicates?

The way I write, I tend to do a first pass which includes tons of mistakes, missing words, repeated words, very long sentences, bad grammar - the lot. I do this in order to just do a brain dump and get everything out of my head as quickly as possible. I've found that if I don't do this, I tend to forget the initial good ideas I had when first thinking about what I was writing so I want to get it all down asap. After this, I go over and over it weeding out the mistakes and filling in the gaps.

This approach suits me well but it does mean my writing's full of mistakes right from the start that I have to spot and correct later. Unfortunately, I seem to be afflicted with a 'similar words' blindness and often have duplicated words and closely related words very close to each other. Sometimes, I can be reading some text I've written for the twelfth time before I spot one of these duplicates (very often within the same sentence). For some reason these particular type of mistake is somewhat camouflaged to me and I shudder to think how often I've not spotted them in the past.

By 'closely related' I mean having words of a similar sound (though not necessarily of a similar meaning) appear close together. For example:

necessarily & necessary & necessity
occasion & occasionally
responsible & responsibility & responsively & response
feel & feeling
exclusively & exclusive & exclusivity & exclude
ask & asking & asked
create & creatively & creativity
successfully & successful
opportunity & opportune
weigh & weight & weighing
avail & available & availability
quality & qualities
realise & realised & realising & real & reality
colour & colouring
and so on...

So, I'd like your new feature to also point out these related words for me. Simply highlighting the pairs (or more) of words in the same colour will be enough but perhaps it might also take proximity into account, allowing the user to define how close the words need to be to each other to be thought of as a problem. For example if 'occasionally' ended one sentence and started the next, it would be a worse case than it starting and ending the same paragraph as fewer words would come between them.

I suspect that a simple search for letter groups would be enough for most instances (such as 'occasion' and 'occasionally') but perhaps other related words wouldn't be that simple and would warrant a list of their own that would be checked (like a users own dictionary). This naturally leads to the idea of the user being able to easily add their own related words to the list for checking.

Examples include:

I'm & I am
your & you're & you are
feel & felt
weigh & weight & weighing & weighted & wait & waiting & waited
decide & decision
their & there & they're
and so on...
User avatar
admin
Site Admin
Posts: 2926
Joined: Wed Jun 05, 2002 10:48 pm
Contact:

Post by admin »

Yes, you are absolutely right. This feature should be able to detect pairs of "similar words" as well as it should take into account the distance between "similar words" (the minimal allowed distance should be a user-defined setting). There should be also support for custom (user-defined) groups of "similar words".
andygrunt
Posts: 230
Joined: Mon Jul 23, 2007 12:35 pm

Post by andygrunt »

I had a thought about a useful addition to the new 'multiple occurrences of words' feature you're working on.

It would be very handy if Atlantis was able to search for and highlight clichés in my text. Personally, I find I often let clichés slip through the net just because they're so familiar. In fact, you might say I can't see the wood for the trees when it comes to spotting clichés.

I would expect such a feature to be relatively easy to implement as it's fundamentally only a slightly different 'search' function. However, rather than searching for a word or string provided by the user, it would have to search for any strings that match (or a similar to?) a database of predefined phrases. I don't know if it's possible to search an existing website of clichés (such as http://www.clichesite.com) or have to create your own. Personally, even if the facility was there but required me to create my own text file of phrases (clichés) to search for, that would be very useful.

What do you think?
andygrunt
Posts: 230
Joined: Mon Jul 23, 2007 12:35 pm

Post by andygrunt »

Hi,

Six months after you announced you were working on this feature, can you give an estimate as to when it will be available?

Thanks.
User avatar
admin
Site Admin
Posts: 2926
Joined: Wed Jun 05, 2002 10:48 pm
Contact:

Post by admin »

Sorry, Andy, but we never announced that we were working on this feature as well as I am not sure regarding "six months". I said that this feature will be available in the next minor release of Atlantis Word Processor. But we actually are now working on it. Sorry, no definite date yet. But it is quite probable that the new minor release of Atlantis will be available this summer. Please be patient.
andygrunt
Posts: 230
Joined: Mon Jul 23, 2007 12:35 pm

Post by andygrunt »

Sorry, yep - 5 months (your original reply in this thread is dated Jan 15 2008). Thanks for the reply and I look forward to the next release.
andygrunt
Posts: 230
Joined: Mon Jul 23, 2007 12:35 pm

Post by andygrunt »

For anyone following this thread that doesn't check back to the forum regularly...

There's now a Beta version available that includes this feature - give it a try :D
Post Reply