More

bkitano19 · 2026-01-16T18:08:55 1768586935

You can use voice prompting; it's supported on ElevenLabs and Hume.

bkitano19 · 2025-10-21T13:38:06 1761053886

Awesome post!

krackers · 2025-10-21T22:00:42 1761084042

Indeed, the title undersells it and I'm glad I didn't skip over it, the article is basically an information-dense but approachable summary of audio generation.

bkitano19 · 2025-08-20T04:41:31 1755664891

you might like https://en.wikipedia.org/wiki/Noether%27s_theorem

bkitano19 · 2025-06-12T01:10:57 1749690657

https://huggingface.co/spaces/nvidia/parakeet-tdt-0.6b-v2

bkitano19 · 2025-02-06T16:22:42 1738858962

Related work:

Interpreting Modular Addition in MLPs https://www.lesswrong.com/posts/cbDEjnRheYn38Dpc5/interpreti...

Paper Replication Walkthrough: Reverse-Engineering Modular Addition https://www.neelnanda.io/mechanistic-interpretability/modula...

joelburget · 2025-02-06T21:41:56 1738878116

And more recently, [Language Models Use Trigonometry to Do Addition](https://arxiv.org/abs/2502.00873)

bkitano19 · on Oct 15, 2024

hume.ai specializes in expressive prosody for TTS (disclaimer - I work here)

bkitano19 · on Aug 27, 2024

Time to first token is as important to know for many use cases, rarely are people reporting it

Gcam · on Aug 27, 2024

See here for our TTFT metric benchmarks: https://artificialanalysis.ai/models/llama-3-1-instruct-70b/...

bkitano19 · on Aug 9, 2024

this is nuts

cpeterson42 · on Aug 9, 2024

We think so too, big things coming :)

goku-goku · on Aug 9, 2024

www.juicelabs.co

bkitano19 · on July 10, 2024

+1, had the fortune to work with him at a previous startup and meetup in person. Our convo very much broadened my perspective on engineering as a career and a craft, always excited to see what he's working on. Good luck Simon!

bkitano19 · on July 8, 2024

https://transformer-circuits.pub/2022/in-context-learning-an...

there is a lot of evidence to suggest that they are performing induction