2025 Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs Ken Tsui 2025 Website NumSeqBench: Benchmarking Inductive Reasoning in Language Models via Number Sequences Ken Tsui 2025 Website 2024 Aurora-M: Open Source Continual Pre-training for Multilingual Language and Code Taishi Nakamura, Mayank Mishra, Simone Tedeschi, and 42 more authors 2024 Website