Puerto, Haritz; Chubakov, Tilek; Zhu, Xiaodan; Tayyar Madabushi, Harish; Gurevych, Iryna. Phi 1.5 Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models. (2024-06). CC BY-SA 3.0. LLM, large language model, NLP, chain of thought, cot, 4.43-04 Künstliche Intelligenz und Maschinelle Lernverfahren, 4.43-05 Bild- und Sprachverarbeitung, Computergraphik und Visualisierung, Human Computer Interaction, Ubiquitous und Wearable Computing, 004. Technical University of Darmstadt. https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4270