— Things I find online and I am looking into.
AI in Microcontrollers(Benchmarks)
LLM Token Rate Optimization:
Calculating GPU Memory for Serving LLMs