(Room 217-219, New Orleans, December 15, 2023, Website)

Accepted Papers

  1. TinyGSM: achieving 80% on GSM8k with one billion parameters
    Bingbin Liu, Sebastien Bubeck, Ronen Eldan, Janardhan Kulkarni, Yuanzhi Li, Anh Nguyen, Rachel Ward, Yi Zhang [Paper]

  2. MinT: Boosting Generalization in Mathematical Reasoning via Multi-View Fine-Tuning
    Zhenwen Liang, Dian Yu, Xiaoman Pan, Wenlin Yao, Qingkai Zeng, Xiangliang Zhang, Dong Yu [Paper]

  3. Continual Learning and Out of Distribution Generalization in a Systematic Reasoning Task
    Mustafa Abdool, Andrew Nam, James McClelland [Paper]

  4. MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
    Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chunyuan Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao [Paper]

  5. Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models
    Pan Lu, Baolin Peng, Hao Cheng, Michel Galley, Kai-Wei Chang, Ying Nian Wu, Song-Chun Zhu, Jianfeng Gao [Paper]

  6. ToolDec: Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding
    Hongqiao Chen, Kexun Zhang, Lei Li, William Yang Wang [Paper]

  7. What Algorithms can Transformers Learn? A Study in Length Generalization
    Hattie Zhou, Arwen Bradley, Etai Littwin, Noam Razin, Omid Saremi, Joshua Susskind, Samy Bengio, Preetum Nakkiran [Paper]

  8. Augmenting Large Language Models with Symbolic Rule Learning for Robust Numerical Reasoning
    Hadeel Al-Negheimish, Pranava Madhyastha, Alessandra Russo [Paper]

  9. Llemma: An Open Language Model For Mathematics
    Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen McAleer, Albert Jiang, Jia Deng, Stella Biderman, Sean Welleck [Paper]

  10. SCIBENCH: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
    Xiaoxuan Wang, Ziniu Hu, Pan Lu, Yanqiao Zhu, Jieyu Zhang, Satyen Subramaniam, Arjun Loomba, Shichang Zhang, Yizhou Sun, Wei Wang [Paper]

  11. Spoken Language Understanding Evaluations for Home-Based Basic Math Learning
    Eda Okur, Saurav Sahay, Lama Nachman [Paper]

  12. llmstep: LLM proofstep suggestions in Lean
    Sean Welleck, Rahul Saha [Paper]

  13. SIRD: Symbolic Integration Rules Dataset
    Vaibhav Sharma, abhinav nagpal, Muhammed Fatih Balin [Paper]

  14. AI for Mathematics: A Cognitive Science Perspective
    Cedegao Zhang, Katherine Collins, Adrian Weller, Joshua Tenenbaum [Paper]

  15. CNN models’ sensitivity to numerosity concepts
    Neha Upadhyay, Sashank Varma [Paper]

  16. Teaching Arithmetic to Small Transformers
    Nayoung Lee, Kartik Sreenivasan, Jason Lee, Kangwook Lee, Dimitris Papailiopoulos [Paper]

  17. Reinforcement Learning in Control Theory: A New Approach to Mathematical Problem Solving
    Kala Agbo Bidi, Jean-Michel Coron, Amaury Hayat, Nathan Lichtlé [Paper]

  18. Teaching small transformers to rewrite ZX diagrams
    Francois Charton, Alexandre Krajenbrink, Konstantinos Meichanetzidis, Richie Yeung [Paper]

  19. SatLM: Satisfiability-Aided Language Models Using Declarative Prompting
    Xi Ye, Qiaochu Chen, Isil Dillig, Greg Durrett [Paper]

  20. OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text
    Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba [Paper]

  21. A Language-Agent Approach to Formal Theorem-Proving
    Amitayush Thakur, Yeming Wen, Swarat Chaudhuri [Paper]

  22. Lemur: Integrating Large Language Models in Automated Program Verification
    Haoze Wu, Clark Barrett, Nina Narodytska [Paper]

  23. Temperature-scaled large language models for Lean proofstep prediction
    Fabian Gloeckle, Baptiste Roziere, Amaury Hayat, Gabriel Synnaeve [Paper]

  24. Basic Arithmetic Properties in the Space of Language Model Prompts
    Mateusz Krubiński [Paper]

  25. Magnushammer: A Transformer-Based Approach to Premise Selection
    Maciej Mikuła, Szymon Antoniak, Szymon Tworkowski, Bartosz Piotrowski, Albert Jiang, Jin Peng Zhou, Christian Szegedy, Łukasz Kuciński, Piotr Miłoś, Yuhuai Wu [Paper]

  26. EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning
    Raja Sekhar Reddy Mekala, Yasaman Razeghi, Sameer Singh [Paper]

  27. Learning Multi-Step Reasoning by Solving Arithmetic Tasks
    Tianduo Wang, Wei Lu [Paper]

  28. Towards Large Language Models as Copilots for Theorem Proving in Lean
    Peiyang Song, Kaiyu Yang, Anima Anandkumar [Paper]

  29. Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts
    Tengxiao Liu, Qipeng Guo, Yuqing Yang, Xiangkun Hu, Yue Zhang, Xipeng Qiu, Zheng Zhang [Paper]

  30. ARB: Advanced Reasoning Benchmark for Large Language Models
    Tomohiro Sawada, Daniel Paleka, Alexander Havrilla, Pranav Tadepalli, Paula Vidas, Alexander Kranias, John Nay, Kshitij Gupta, Aran Komatsuzaki [Paper]

  31. Learning the greatest divisor - Explainable predictions in transformers
    Francois Charton [Paper]

  32. Can We Count on Deep Learning: Exploring and Characterizing Combinatorial Structures Using Machine Learning
    Helen Jenne, Herman Chau, Davis Brown, Jackson Warley, Timothy Doster, Henry Kvinge [Paper]

  33. Discovering Lyapunov functions with transformers
    Alberto Alfarano, Francois Charton, Amaury Hayat [Paper]

  34. CHAMP: A Competition-Level Dataset for Fine-Grained Analyses of LLMs’ Mathematical Reasoning Capabilities
    Yujun Mao, Yoon Kim, Yilun Zhou [Paper]

  35. Vertical AI-driven Scientific Discovery
    Yexiang Xue [Paper]

  36. Solving Math Word Problems by Combining Language Models With Symbolic Solvers
    Joy He-Yueya, Gabriel Poesia, Rose Wang, Noah Goodman [Paper]

  37. Exploration with Principles for Diverse AI Supervision
    Hao Liu, Matei Zaharia, Pieter Abbeel [Paper]

  38. Solving Math Word Problems with Reexamination
    Yi Bin, WENHAO SHI, Yujuan Ding, Yang Yang, See-Kiong Ng [Paper]

  39. LLM vs ITP
    Simon Frieder, Martin Trimmel, Rashid Alawadhi, Klaus Gy [Paper]

  40. Probabilistic Abduction for Visual Abstract Reasoning via Learning Rules in Vector-symbolic Architectures
    Michael Hersche, Francesco di Stefano, Thomas Hofmann, Abu Sebastian, Abbas Rahimi [Paper]

  41. Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
    Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Tudor Berariu, Matej Balog, Gheorghe Comanici, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Laurent Orseau, Petar Veličković, Anurag Murty Naredla, Joonkyung Lee, Adam Zsolt Wagner, Doina Precup [Paper]