Skip to content

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

  • Home
  • Blog Index
  • About
  • What is NLG
  • Publications
  • Resources
  • University
  • Book
  • Contact

Day: May 9, 2017

Uncategorized

You Need to Understand your Corpora! The Weathergov Example

May 9, 2017May 9, 2017 ehudreiter8 Comments

People who use corpora to build NLG systems need to understand what is in the corpora. The widely used Weathergov corpus, for example, probably contains computer-generated texts rather than human-written texts. So learning from it is essentially reverse-engineering a rule-based NLG system.

  • LinkedIn
  • Twitter

News: I am likely to retire in summer 2026. Looking for interesting things to do afterwards.

Top Posts & Pages

  • What LLMs cannot do
  • Retirement Plans: Travel and some academics
  • Exercise: Find Problems in an Evaluation
  • Unresponsive Authors and Experimental Flaws
  • Do LLM coding benchmarks measure real-world utility?
  • Do We Encourage Researchers to Use Inappropriate Data Sets?
  • I'm very worried about data contamination
  • Types of NLG Evaluation: Which is Right for Me?
  • Does Deep Learning Prefer Readability over Accuracy?
  • Systematic Reviews in NLP
Blog at WordPress.com.
  • Subscribe Subscribed
    • Ehud Reiter's Blog
    • Join 102 other subscribers.
    • Already have a WordPress.com account? Log in now.
    • Ehud Reiter's Blog
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar