publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- Formalising Human-in-the-Loop: Computational Reductions, Failure Modes, and Legal-Moral ResponsibilityarXiv preprint arXiv:2505.10426, 2025
- Framing the Game: How Context Shapes LLM Decision-MakingarXiv preprint arXiv:2503.04840, 2025
2024
- Evaluating AI Evaluation: Perils and ProspectsJun 2024_eprint: 2407.09221
- Conversational Complexity for Assessing Risk in Large Language ModelsSep 2024arXiv:2409.01247 [cs, math]
- The Animal-AI Environment: A Virtual Laboratory For Comparative Cognition and Artificial Intelligence ResearchOct 2024arXiv:2312.11414 [cs]
2023
-
-
- Inferring Capabilities from Task Performance with Bayesian TriangulationSep 2023arXiv:2309.11975 [cs]
- Your Prompt is My Command: On Assessing the Human-Centred Generality of Multimodal ModelsJournal of Artificial Intelligence Research, Sep 2023
- Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language modelsTransactions on Machine Learning Research, Sep 2023
- An International Consortium for Evaluations of Societal-Scale Risks from Advanced AINov 2023arXiv:2310.14455 [cs]
2022
- How Sure to Be Safe? Difficulty, Confidence and Negative Side EffectsIn NeurIPS ML Safety Workshop, Nov 2022
- Not a Number: Identifying Instance Features for Capability-Oriented EvaluationIn Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-22, Jul 2022
- Evaluating object permanence in embodied agents using the animal-AI environmentEBeM’22: Workshop on AI Evaluation Beyond Metrics, July 25, 2022, Vienna, Austria, Jul 2022Publisher: CEUR Workshop Proceedings
- How general-purpose is a language model? usefulness and safety with human prompters in the wildIn , Jul 2022Issue: 5
2021
- Latent Property State Abstraction For Reinforcement LearningIn Proceedings of the AAMAS Workshop on Adaptive Learning Agents (ALA), Jul 2021
2020
- Automating abstraction for potential-based reward shapingDec 2020Publisher: University of York
- Uniform State Abstraction for Reinforcement LearningIn 24th European Conference on Artificial Intelligence,, Dec 2020