academics

Please follow the rules for ARR/ACL papers

There are a lot of things which annoy me about ARR/ACL papers. Mostly content-level issues (such as poor evaluations and Hovy’s “LLM popcorn”), which I have written about elsewhere. But I am also unhappy with writing and presentation, especially when I see authors violate ARR rules and guidelines (see CFP and formatting guidelines). I really wish that authors followed the rules, and that reviewers rejected papers which did not follow the rules! I describe some of these problems below.

Readable font size in diagrams

As I have mentioned in a previous blog, text in diagrams must be readable. The ACL formatting guidelines state that

For any text or numbers in tables and figures, whenever possible, please use the font size of the document text. As a rule of thumb, any text or numbers should be clearly readable when the paper is printed on A4 paper. Submissions that abuse the font size or spacing for figures/tables may be desk rejected.

This rule is widely violated. I remember on one occasion a student showed me a draft paper which included a figure that needed 400% zoom to be legible. The student did not see anything wrong with this, but it is not acceptable to readers.

When I review a paper, I recommend desk rejection if the paper has unreadable figures.

Key information in appendices

Appendices in ARR papers have exploded in size, I often see 10-20 pages of appendices in an ARR paper. What is even worse is that authors put key material (eg literature review or important results) into appendices. Appendices are for supplementary material, especially material which supports replication. The ARR CFP states that

For example, pre-processing decisions, model parameters, feature templates, lengthy proofs or derivations, pseudocode, sample system inputs/outputs, and other details that are necessary for the exact replication of the work described in the paper can be put into appendices.

However, reviewers are not required to consider material in appendices. If the pseudo-code, or derivations, or model specifications are an important part of the contribution, or if they are important for the reviewers to assess the technical correctness of the work, they should be a part of the main paper rather than appendices.

When I review a paper, I follow the above rules and do not look at appendices. So if the literature review or key results are in an appendix, I automatically reject the paper.

I like TACL’s rules for appendices (5 pages max for details which support replication; 3 pages max for additional tables; no free text). I wish ARR would adopt something similar. This would help authors as well as reviewers, since many authors complain to me that some reviewers expect very large and detailed appendices.

Incorrect content from LLMs

I increasingly see papers and submissions (sometimes from respected researchers at major research universities) which contains hallucinated content which is incorrect, such as non-existent references. In one case, one of the authors told me that this had nothing to do with her, the relevant section had been written by a PhD student. The ARR CFP states that

In all cases, all authors are fully responsible for the correctness of their methods, results, and writing. They should check for potential plagiarism, both of text and code

If you add your name to a paper written by a PhD student, you are responsible for the paper being correct (and ethical), including LLM-generated content! If you have concerns (or have not even read the paper in question), you should not be an author.

Needless to say, if I review a paper with hallucinated content, I will reject it. In fields such as medicine, fraudulent content can lead to disciplinary action. In fact, the person who contacted me above worked for a university which had strict rules about this kind of thing. She could have been in a lot of trouble if someone reported this to her university (I did not do this).

Reference published papers and use DOIs

Many researchers, especially students, seem to prefer to read and cite Arxiv versions of a paper, even when the paper has been published in a peer-reviewed venue. I strongly believe that this is bad practice, since published versions often include improvements required by peer reviewers (blog).

I am not aware of an ARR policy about this, but I have seen other AI venues request that citations be to published papers if these exist. For example, the author guidelines for the computer vision conference CVPR 2026 states that

To reduce confusion, whenever citing papers that initially appeared on arXiv, the authors should check whether those papers had subsequently been published in a peer-reviewed venue, and to cite those versions accordingly.

Another issue is the use of DOIs for cited papers. DOIs are becoming the standard online reference for academic papers (including Arxiv papers) in most scientific fields. I think they work well, and the ARR formatting guidelines ask people to include DOIs for cited papers

All references are required to contain DOIs of all cited works when possible, or, as a second resort, links to ACL Anthology pages.

Most ARR papers seem to include DOIs for some references but not others, probably because some sources of references (such as ACL Anthology) include DOIs but others (such as Google Scholar) may not. I do not reject papers that lack DOI, but I do encourage authors to include them, this is good pratice.

Final thoughts

One thing that worries me is that a lot of the above is driven by research culture and expectations. Authors see nothing wrong with unreadable figures, literature reviews in appendices, etc, since they read a lot of other papers that do this. Of course poor research culture impacts content as well as writing.

It is hard to change research culture (blog), but publication venues can do this if they set clear rules and enforce them. ARR has established good rules and guidelines (although of course they could be improved, eg for appendices), lets insist that they be followed!

Postscript (18 March 2026)

A few people have told me that it is difficult for researchers who are new to ARR/ACL to digest all of ARR’s rules. I appreciate this, and realise that such people tend to simply imitate what they see in published ARR paper. Since many published papers violate the ARR rules and include diagrams with miniscule font (etc), newcomers assume this is the way ARR papers are supposed to be written.

There is no easy solution, but it would help if those of us who have been in the community for a while followed the rules in our own papers and highlighted violations in our reviews, this could help newcomers understand what is expected.

3 thoughts on “Please follow the rules for ARR/ACL papers

  1. Another issue is accessibility of visualisations for readers who are colour-blind or have other visual/cognitive impairments. Authors should make sure that figures have enough contrast and ideally ensure there is some redundancy in the presentation (e.g. not just colour contrasts but also using different patterns for different bars in a bar chart).

    Liked by 1 person

  2. The trend with appendices has been frustrating for me too. My big concern is that readers may not realize that appendices are not peer-reviewed, and cite the peer-reviewed paper for a quote or mention from the not-peer-reviewed appendix. I’d prefer to either include appendices against the page limit and require review, or distribute them as supplemental material which more clearly signals that it isn’t peer-reviewed.
    Regarding incorrect references or other hallucinations, do you have any tips for detecting those? I find it’s a struggle to manage my time when reviewing.

    Like

    1. Hi, I do not explicitly look for hallucinations, but sometimes I discover them when I try to read a referenced article. There are tools that look for hallucinated refs, I don’t have much experience with them.

      Like

Leave a reply to ehudreiter Cancel reply