Invent, design, and implement efficient algorithms to optimize Large Language Model inference on DNN Accelerators.
Must-have
Nice-to-have
Not specified