Yes, there are some: there is igerman98_all.xml.bz2 - German lemma list in XML format based on ispell word list from Niels Ott's BananaSplit.
Or, you could generate the list from your text with the TreeTagger tool. For each text token it assigns a tag, that tells us this is a noun, this is an articel.

But the problem with the dictionaries is: Most dictionaries include also the compound words, while we don't want them in our dictionary. The splitter needs the words in its very basic form. If you have compound words in your dictionary, the splitter doesn't break them up further ...

So I decided to create myself a list. It was easy. I started from the 500 most used search terms on my website. And then i splitted them manually. It was easy and did not take long.

Reply

The content of this field is kept private and will not be shown publicly.
  • Internal paths in double quotes, written as "internal:node/99", for example, are replaced with the appropriate absolute URL or relative path.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <i> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd> <p> <h3> <h4> <b> <object> <param> <embed> <img> <hr> <table> <tr> <td>
  • Lines and paragraphs break automatically.
  • You may use [acidfree:xx] tags to display acidfree videos or images inline.

More information about formatting options

CAPTCHA
This question is for testing whether you are a human visitor and to prevent automated spam submissions.
1 + 18 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.