Making impactful research more rigorous — and rigorous research more impactful
A talk for university researchers
The Unjournal (unjournal.org) — a grant-funded nonprofit that pays experts to publicly evaluate and rate research, and assesses Pivotal Questions for stakeholders.
We aim to make impactful research more rigorous, and academic work more useful.
We support open science, open access & transparency.
We work to improve peer-review — aligning research incentives with truth-seeking and social value.
Focus: economics, policy & quantitative social science with global-impact potential · unjournal.pubpub.org
Journals do real things — curation, dissemination, community, a trusted signal.
But we already disseminate ourselves — working papers, arXiv, SSRN, dynamic docs — and a 17th-century filter still governs careers and credibility.
So the scarce, valuable part is increasingly the evaluation — the expert judgement — and we throw most of it away.
The average economics paper goes to 3–4 journals before it’s placed.
Reviewer time alone is a back-of-envelope ~$150M/year in economics.
But the biggest cost is authors’ time — reformatting, resubmitting, journal-shopping, strategising instead of just improving the work.
…e.g. “spin it as a hamburgers-economics paper for the American Hamburger Journal”
“Playing this game diverts us from producing the most credible, useful research.”
Journals take one format: ~30 static pages.
Publication says “done” — so we slice off the next paper.
Little room to improve, correct, or build in place.
Evaluation needn’t be chained to a 30-page PDF:
Any format — dynamic, interactive, replicable documents.
Improve in place → then ask for further evaluation.
Open evaluation feeds open science & replication.
A non-profit commissioning open evaluation of publicly-hosted research with potential for global impact.
We commission and pay for expert evaluation — and authors can still publish in a journal too.
Multiple evaluations, structured ratings, and an author response — all public, with DOIs.
Credible, citable peer review — not tied to any journal’s accept/reject.
unjournal.org · unjournal.pubpub.org · Funders: Survival & Flourishing Fund, Long-Term Future Fund, EA Infrastructure Fund
We don’t accept/reject or assign a tier; so we benchmark instead. Two halves, equally important:
The substance
The structured ratings
All public, citable, and comparable — see the evaluator interface → · ↓ the actual instrument
Some research users want more than “which journal published it” — and they want it faster:
Funders & research users who need evidence to act — e.g. Coefficient Giving, Survival & Flourishing Fund.
They want credible expert judgment, transparent reasoning, quantified beliefs & uncertainty.
What they’re really after: decision-relevance and value of information.
Many don’t yet know which questions matter most — or what the evidence already says. Helping surface and answer those is part of the job (prioritisation; Pivotal Questions).
And researchers keep their independence: we evaluate and prioritise existing public work — we don’t commission, own, or steer the research itself.
A different demand signal from the journal system.
↓ “but doesn’t this need everyone to move at once?”
Academics ~broadly agree open evaluation is better — but can’t move first alone.
Funding & grantmaker incentives can tip the balance.
We work to be highly visible — so evaluations & ratings are seen before conventional reviewers weigh in.
Building a bridge, not asking you to jump off one: Fear of Standing Out → Fear of Missing Out.
source:unjournal in Google Scholar — evaluations indexed and discoverable alongside the papers themselves.
One round of public evaluation → a credible output now.
A publicly citable signal after one round.
A traditional journal: 6+ months, R&R at best — then maybe accepted after substantial revisions.
For fast-moving topics, that lag means missing the decision window entirely.
AI capabilities · AI’s impact on labour markets · policy windows
Target ~2–3 months · prioritisation → published package
Versus a traditional economics journal: ~1–3 years (often 24+ months to acceptance).
Who evaluates? Domain experts from our 180+ pool (½ hold doctorates, ~40 professors), matched to each paper — paid, named or anonymous. ▶ 2-min explainer · ↓ full workflow & video · who’s behind it (§3)
▶ Watch the 2-minute explainer on YouTube
Prioritisation is triage, not evaluation
First question: will better evidence here change real decisions?
We do prioritise influential, widely-read work — but we don’t chase the merely clever.
↓ how the triage actually runs
57 evaluation packages on PubPub
100+ expert evaluations
180+ evaluators (120+ PhDs, ~40 profs)
~$450 avg evaluator payment
1,000+ structured ratings recorded
40+ field specialists
ISSN 3071-2173 · 501(c)(3) · DOIs
2024–25 Evaluator Prize · 1st Evaluation of “Water Treatment & Child Mortality: a meta-analysis”
Across tracked evaluations
A management team (7) and advisory board (16) govern the process and standards; field specialist team members (~60) source and prioritize research:
Each paper is evaluated by ~2 domain experts, often matched from our evaluator pool:
180+ evaluators in the pool
½+ are economists
½+ hold doctorates
40+ field specialists · 8 areas
↓ the advisory board
Development economics Anirudh Tagat · Ryan Briggs · Michael Wiebe · Nathan Fiala · Emmanuel Orkoh · Robert Kubinec · Masyhur Hilmy · Wayne Sandholtz · Lee Crawfurd · Yannick Dupraz · Leena Bhattacharya · William Seitz
Global health & well-being Jake Eaton · Rosie Bettle · Charlotte Lane · Shobhit Kulshreshtha · Jonah Goldberg · Valentin Klotzbücher · Priya Lall · Francesco Ramponi · Sarah Reynolds
Economics, welfare & governance David Reinstein · Julian Jamison · Tabaré Capitán · Joel Christoph · Andrei Potlogea · Greg Sasso · Brian Weber · Daniel Horn · Moritz Hennecke · Seth Benzell
Psychology, behavioral science, attitudes Hansika Kapoor · Jonathan Berman · Mattie Toma · Carina Ines Hausladen · Hannah Metzler
Innovation, meta-science, social impact of technology Daniela Cialfi · Jordan Dworkin · Kris Gulati · Andrew Kao · Gavin Taylor · Gary McDowell
Environmental economics Tanya O’Garra · Ben Balmford
Animal welfare (markets, attitudes) Josh Tasoff · Kevin Kuruc · Florian Habermacher · Nicolas Treich · Ash Mader · Brinda Poojary
Catastrophic risks, AI governance & safety David Manheim · Anca Hanea · Alexander Herwix · Tristan Williams
From single papers → identifying stakeholders’ specific ‘operationalised’ questions that matter:
What would change key decisions — and what research evidence informs it?
What do experts believe now, and how uncertain are they?
Researchers + practitioners + stakeholders — incl. Founders Pledge, Animal Charity Evaluators; participants from Coefficient Giving & more.
Beliefs on our platforms · overview · workshops: cultured-meat · wellbeing
Our field specialists and evaluators are spread across many universities — there’s a good chance some are already in your department.
Some are field specialists who help us prioritise and recruit.
Others sit in our 180+ evaluator pool, matched to papers in their area.
And we’ve evaluated work co-authored by university economists like those in this room.
↓ where a department’s strengths might fit
| A department strength | …maps onto Unjournal work |
|---|---|
| Behavioural & experimental | belief elicitation; the Pivotal-Questions forecasts |
| Environmental & climate | natural-capital valuation; our climate & animal-welfare evaluations |
| Health, wellbeing & development | cost-effectiveness, WELLBYs, the RCTs we prioritise |
| Econometrics & methods | meta-science; calibrating our ratings |
| Open research & reproducibility | public disciplinary judgement alongside repositories & compliance |
| Development | the field RCTs we prioritise |
| AI, technological change & labour | AI’s societal & labour-market impact — a fast-growing Unjournal priority |
Staff · postdocs · advanced PhDs
Paid (~$450 avg) — for work you partly do already.
Faster and more visible than a report that vanishes into a journal.
Named or anonymous; counts as service; citable with a DOI.
A referee report you’re proud of becomes a public, citable output.
Authors
Submit a working paper → credible public evaluations and ratings.
The journal path stays open — get feedback and a public signal before it resolves.
Or suggest others’ high-impact work — anonymously if you like.
A public commitment — and a signal.
“I’m willing to have this evaluated openly.”
Requesting open evaluation can itself carry information — strong-but-under-credited work has the most to gain.
Most valuable when your work is strong but under-credited — or sitting just below the bar.
If committing to open evaluation becomes a positive signal, you’ll want in early.
Less of a clear win when the work already clears the bar and it’s a sensitive career moment.
Timing concerns? Talk to us — we can embargo or schedule.
Full “model” (v. preliminary, ~Fable-generated with human feedback): unjournal-reluctance-note.netlify.app
See what economists, funders and practitioners actually care about — a methodological conversation that sharpens your own work.
Do real peer review, and get feedback on your evaluation from us, often from the authors.
Gain visibility within a network of funders, grantmakers and impact-minded researchers.
And potential RA / fellowship roles: evaluation, meta-research, Pivotal-Questions support.
Evaluation packages & prioritisation → a vetted evidence base to build on, teach, cite, and discuss.
Pivotal Questions & workshops → framing for agendas, grants, collaborations, research-impact cases.
Public evaluations → possible research-assessment, grant, or esteem evidence (e.g. the UK’s REF).
The ratings dataset → meta-analysis, and field-experiment collaborations on the evaluation process itself.
Funders and nonprofits read these evaluations.
Some use them in grantmaking and methodology.
A route to feedback, uptake, and sometimes collaboration.
A way to put careful work in front of people who actually use evidence.
A strong public evaluation speaks to quality and usefulness directly — alongside what the venue signals, not only where it landed.
Multidimensional ratings with uncertainty, expert reports and discussion, an author response, a citable DOI.
For research leaders & managers — encouraging engagement signals a commitment to rigour, transparency and innovation.
And it opens the research-impact channel: our funder and practitioner network, including Pivotal Questions.
A flood of plausible AI-generated papers — some correct and useful, many not.
So more need for efficient, transparent evaluation — connected to real stakeholders and impact.
AI can help: scalable code and data checks.
But the current consensus: keep a human in the loop for the final calls.
Not “does it fit a top-5 template” — but “is it true, and does it matter?”
One exploratory pilot · ~45 papers
A “frontier” (Jan. 2026) LLM vs. our human ratings: only modest rank agreement (r ≈ 0.3).
Human–human agreement still exceeds human–LLM.
On written critiques, LLMs catch ~¾ of human concerns — but ~half their flags aren’t substantive.
Not yet a substitute — but an open question, and we’re exploring AI prioritisation, research reasoning, and alignment here.
Preliminary methods & results: llm-uj-research-eval.netlify.app/methods
What does open (Unjournal) evaluation provide?
Now: faster, useful feedback + a credible public signal, and useful inputs to practitioners and funders.
Soon: it starts to carry career value.
Eventually: it can replace much or all of what we ask the journal stamp to do.
Which of these would actually help your work?
David Reinstein · contact@unjournal.org · unjournal.org · unjournal.pubpub.org