Skip to content

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

  • Home
  • Blog Index
  • About
  • What is NLG
  • Publications
  • Resources
  • University
  • Book
  • Contact

Day: May 9, 2017

Uncategorized

You Need to Understand your Corpora! The Weathergov Example

May 9, 2017May 9, 2017 ehudreiter8 Comments

People who use corpora to build NLG systems need to understand what is in the corpora. The widely used Weathergov corpus, for example, probably contains computer-generated texts rather than human-written texts. So learning from it is essentially reverse-engineering a rule-based NLG system.

  • LinkedIn
  • Twitter

News: I am likely to retire in summer 2026. Looking for interesting things to do afterwards.

Top Posts & Pages

  • What LLMs cannot do
  • Is building neural NLG faster than rules NLG? No one knows, but I suspect not.
  • Publish in Journals!
  • "Will I Pass my PhD Viva"
  • Google: Please Stop Telling Lies About Me
  • Benchmarks distract us from what matters
  • We need better LLM benchmarks
  • Evaluating chatGPT
  • Hallucination in Neural NLG
  • Challenges in Evaluating LLMs
Blog at WordPress.com.
  • Subscribe Subscribed
    • Ehud Reiter's Blog
    • Join 100 other subscribers.
    • Already have a WordPress.com account? Log in now.
    • Ehud Reiter's Blog
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar