Skip to content

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

  • Home
  • Blog Index
  • About
  • What is NLG
  • Publications
  • Resources
  • University
  • Book
  • Contact

Tag: data set

academics

Challenging NLG datasets and tasks

Mar 4, 2021 ehudreiter5 Comments

I would like neural NLG researchers to focus on more challenging datasets, and make some suggestions.

Uncategorized

Do people “cheat” by overfitting test data

Feb 6, 2020Feb 11, 2020 ehudreiter2 Comments

NLP in 2020 is dominated by papers which report small improvements in state-of-art. I suspect that a lot of these improvements are due to overfitting test data, not to genuine scientific advances.

  • LinkedIn
  • Twitter

News: I am likely to retire in summer 2026. Looking for interesting things to do afterwards.

Top Posts & Pages

  • What LLMs cannot do
  • Generated Texts Must Be Accurate!
  • Publish in Journals!
  • Is building neural NLG faster than rules NLG? No one knows, but I suspect not.
  • We need better LLM benchmarks
  • Do LLMs cheat on benchmarks
  • Do We Encourage Researchers to Use Inappropriate Data Sets?
  • Google: Please Stop Telling Lies About Me
  • We Need Robust Ways to Select Content of NLG Texts
  • Benchmarks distract us from what matters
Blog at WordPress.com.
  • Subscribe Subscribed
    • Ehud Reiter's Blog
    • Join 100 other subscribers.
    • Already have a WordPress.com account? Log in now.
    • Ehud Reiter's Blog
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar