Research
I'm interested in understanding how large language models can learn from new data, exploring open-source models and methods, and applying these models to other domains (e.g., crypto).
|
|
Agent Instructs Large Language Models to be General Zero-Shot Reasoners
Nicholas Crispino, Kyle Montgomery, Fankun Zeng, Dawn Song, Chenguang Wang
Forty-first International Conference on Machine Learning, 2024
arxiv /
code /
Zero-shot AgentInstruct uses an agent to generate dataset-specific instructions to improve the zero-shot performance of instruction-following large language models.
|
Other Projects
These include coursework, side projects and unpublished research work.
|
|