Skip to content

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

  • Home
  • Blog Index
  • About
  • What is NLG
  • Publications
  • Resources
  • University
  • Book
  • Contact

Year: 2018

Uncategorized

How Would I Automatically Evaluate NLG Systems?

Jul 25, 2018Aug 7, 2018 ehudreiter1 Comment

Some musings on principled and theoretically sound techniques for automatically evaluating NLG systems.

Uncategorized

How to Validate Metrics

Jul 10, 2018Aug 7, 2018 ehudreiter4 Comments

My advice on how to perform a high-quality validation study, which assesses whether a metric (such as BLEU) correlates well with human evaluations.

Uncategorized

Why doesnt BLEU work for NLG?

Jul 2, 2018Aug 7, 2018 ehudreiter8 Comments

BLEU works much better for MT systems and NLG systems. In this blog I present some speculations as to why this is the case.

Uncategorized

BLEU in Different Languages: Dont use it for German

Jun 20, 2018Aug 7, 2018 ehudreiter1 Comment

My structured survey of BLEU suggests that BLEU-human correlations are worse in German than in many other languages. But there are many caveats, so we need to be cautious in interpreting this result.

Uncategorized

BLEU-Human Correlation is Increasing: What does this Mean?

Jun 14, 2018Aug 7, 2018 ehudreiter6 Comments

The correlation between BLEU and human evaluations of MT systems seems to be increasing over time. Since BLEU has not changed, how is this possible, and what does it mean?

Uncategorized

Many Papers on Machine Learning in NLP are Scientifically Dubious

Jun 6, 2018 ehudreiter1 Comment

In response to a previous blog, many people expressed concerns to me about the quality of many papers they saw on ML in NLP. I summarise some of these concerns, which are worrying.

Uncategorized

Learning does not require evaluation metrics

May 30, 2018May 30, 2018 ehudreiter3 Comments

I was recently asked if machine learning requires evaluation metrics. The answer is no, and the fact that people are asking such questions suggests that some newcomers to the field may have a limited perspective on NLP research methodology.

Uncategorized

How Should Different NLG Components Add Value?

May 17, 2018 ehudreiterLeave a comment

Some comments on how different components in the NLG pipeline can “add value” by contributing to the ultimate goal of generating texts that easy for people to read and understand.

Uncategorized

Challenges of Surface Realisation

Apr 30, 2018 ehudreiter1 Comment

I think surface realisation becomes especially challenging when syntax depends on semantics or pragmatics. From engineering perspective, handling phenomena that only occur in a few languages can be painful.

Uncategorized

“Will I Pass my PhD Viva”

Apr 16, 2018 ehudreiter2 Comments

Many students get stressed about their PhD viva (oral exam) even though they are very unlikely to fail. I present some rules and a flowchart to suggest when there is real cause for concern, and when there is not.

Posts navigation

Older Posts
Newer posts
  • LinkedIn
  • Twitter

News: I am likely to retire in summer 2026. Looking for interesting things to do afterwards.

Top Posts & Pages

  • What LLMs cannot do
  • The Aberdeen NLP Research Group
  • Defining hallucination is not straightforward
  • Publications
  • Lets use AI to help people manage illness
  • Retirement Plans: Travel and some academics
  • About
  • ACL vs TACL Reviewing
  • Challenges in Evaluating LLMs
  • The latest/trendiest tech isnt always appropriate
Blog at WordPress.com.
Ehud Reiter's Blog
Blog at WordPress.com.
  • Subscribe Subscribed
    • Ehud Reiter's Blog
    • Join 102 other subscribers.
    • Already have a WordPress.com account? Log in now.
    • Ehud Reiter's Blog
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...