other

Bayesian vs Neural Networks

Jul 5, 2021Jul 5, 2021 ehudreiter1 Comment

Why would anyone use a Bayesian model instead of a neural model in clinical decision support? Perhaps because the Bayesian model is much easier to justify and adapt to a changing world. Explaining Bayesian models is also a really interesting research challenge, and one of my colleagues has funding for a PhD student in this area.

Uncategorized

Is building neural NLG faster than rules NLG? No one knows, but I suspect not.

May 11, 2020May 11, 2020 ehudreiter6 Comments

A colleague asked me if it was true that building neural NLG systems was faster than building rule-based NLG systems. The answer is that we dont know, because we dont have good data on this question. However the weak evidence we do have suggests that building rules-based NLG is no slower and may be faster than building neural NLG, at least for data-to-text systems.

Uncategorized

Lets Use ML for Insights!

Mar 20, 2020 ehudreiterLeave a comment

I’d love to see more people using machine learning to provide insights about NLG problems and related linguistic issues. I personally think this is much more useful than tweaking models to show a 1% increase in state-of-art in a very artificial context.

Uncategorized

Do people “cheat” by overfitting test data

Feb 6, 2020Feb 11, 2020 ehudreiter2 Comments

NLP in 2020 is dominated by papers which report small improvements in state-of-art. I suspect that a lot of these improvements are due to overfitting test data, not to genuine scientific advances.

Uncategorized

Amateurs focus on models; professionals focus on data

Jan 14, 2020 ehudreiter2 Comments

There is a military saying that “amateurs discuss tactics, professionals discuss logistics”. Similarly I think AI professionals should focus on data more than models. I suggest four simple initial questions to ask about your data if you want to build an ML system.

Uncategorized

Lessons from 25 Years of Information Extraction

Jan 2, 2020Jan 2, 2020 ehudreiter1 Comment

I really liked Grishman’s recent paper on 25 years of research in information extraction, and summarise a few of the key insights here, about relative progress in different areas of NLP, reluctance of researchers to use complex evaluation techniques, and corpus creation vs rule-writing.

Uncategorized

ML is Used More if it Does Not Limit Control

Aug 15, 2019 ehudreiterLeave a comment

When we try to use ML in commercial NLG contexts, one of the challenges is that NLG developers want to be able to customise, configure, and control their systems. So we need ML approaches which do not stop devs from configuring things they are likely to want to change.

Uncategorized

Mistakes in Evaluating ML

May 15, 2019May 15, 2019 ehudreiter2 Comments

Unfortunately, I see many students (and indeed other people) make some basic mistakes when evaluating machine learning, for classifiers as well as NLG.

Uncategorized

Skills Required to Use Different NLG Technologies

Apr 4, 2019Apr 4, 2019 ehudreiter1 Comment

An important difference between different approaches to building NLG systems is the skills needed to use these approaches to build systems. Machine learning requires the most skills, smart templating the least, and simplenlg-type programmatic approaches are in the middle.

Uncategorized

Does Deep Learning Prefer Readability over Accuracy?

Jan 8, 2019Feb 1, 2019 ehudreiter1 Comment

In both NLG and MT contexts, deep learning approaches can result in texts which are fluent and readable but also incorrect and misleading. This is problematical if accuracy is more important than readability, as is the case in most NLG contexts.

Ehud Reiter's Blog

Ehud's thoughts about Natural Language Generation. Also see my book on NLG.

Tag: Machine Learning

Bayesian vs Neural Networks

Is building neural NLG faster than rules NLG? No one knows, but I suspect not.

Lets Use ML for Insights!

Do people “cheat” by overfitting test data

Amateurs focus on models; professionals focus on data

Lessons from 25 Years of Information Extraction

ML is Used More if it Does Not Limit Control

Mistakes in Evaluating ML

Skills Required to Use Different NLG Technologies

Does Deep Learning Prefer Readability over Accuracy?