academics

Our 2022 Publications: NLG Evaluation, Requirements, Resources

Dec 20, 2022 ehudreiterLeave a comment

I thought I’d end 2022 with a summary of the papers written by my students and I in 2022. All of them are about requirements, resources, and/or evaluation of NLG.

evaluation

How effective is prompting?

Dec 8, 2022Dec 8, 2022 ehudreiter2 Comments

I was very impressed by a recent paper that compared prompting-based MT to MT based on trained models. Results are very interesting; prompting-based MT generates fluent texts which however have accuracy problems. Also the paper itself is an excellent example of a high-quality NLP evaluation, and I recommd it to anyone who wants to do good NLP evaluations.

academics

I dont like leaderboards

Nov 28, 2022Nov 28, 2022 ehudreiter7 Comments

I dont like academic leaderboards. Poor scientific techniques, poor data, and poor evaluation means leaderboard results may not be worth much. I also suspect that the community’s fixation on leaderboards also means less research on important topics that do not fit the leaderboard model, such as understanding user requirements.

academics

Quality assurance for academic research

Nov 9, 2022Nov 9, 2022 ehudreiter2 Comments

Quality assurance processes for academic research, notably peer review by unpaid volunteers, are very lightweight and miss many problems. Better quality assurance processes would require more resources and efforts, but would result in more trustworthy papers.

building NLG systems

Simple vs Complex Models

Oct 26, 2022Oct 26, 2022 ehudreiter1 Comment

I was very impressed by a recent talk about the power of simple white-box models in tasks such as medical diagnosis. I’d love to see more work on simple models in NLP and NLG!

building NLG systems

Summarisation datasets should contain summaries!

Oct 13, 2022Oct 13, 2022 ehudreiter5 Comments

Thge most populat datasets used in summarisation (CNN/DailyMail and XSum) do not actually contain summaries. I find this worrying. Surely the best way to make make progress on summarisation is to use actual summarisation datasets, even if these are less convenient from a “leaderboard” perspective.

other

New book on NLG?

Oct 6, 2022 ehudreiter7 Comments

I’m considering writing a book on NLG (a mere 22 years after my last one), and would welcome feedback from the community on this project.

building NLG systems

Language is diverse!

Sep 20, 2022Sep 20, 2022 ehudreiterLeave a comment

Language is diverse, and different syntax, vocabulary, document structures, etc are used in different domains and genres. NLG developers and researchers need to keep this in mind if they are trying to develop generic NLG components.

building NLG systems

Using language models to improve rule/template NLG

Sep 8, 2022 ehudreiter1 Comment

I am excited by the idea of using a neural language model to improve the output of rule/template NLG. Many academics probably regard this as a boring use of LMs (see my previous blog), but I think it could be very useful in many real world applications.

building NLG systems

Boring uses of language models

Aug 24, 2022Sep 8, 2022 ehudreiter1 Comment

There is lots of excitement and hype about “gee whiz” uses of language models in NLG, such as generating stories from prompts. However, I suspect there maybe more real-world value in using language models for more mundane tasks such as quality assurance.

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

Author: ehudreiter

Our 2022 Publications: NLG Evaluation, Requirements, Resources

How effective is prompting?

I dont like leaderboards

Quality assurance for academic research

Simple vs Complex Models

Summarisation datasets should contain summaries!

New book on NLG?

Language is diverse!

Using language models to improve rule/template NLG

Boring uses of language models