Speech

What's a Transformer? Understanding the Innovation That Changed Artificial Intelligence

Collin Flynn

April 20, 2024

About the talk

"Transformer" is a very generic name for a specific machine learning architecture, first published in a paper titled "Attention Is All You Need".Transformers were conceived to handle text translation, but are surprisingly effective in other (non-text) domains. What was the key innovation that set transformers apart from previous methods?In this talk we'll look at embeddings, gradient decent, loss functions, and how Transformers learn to pick up on conversational context using matrix multiplication. We'll also take a look at a newer architecture called "Mamba", and how it improved upon some of weaknesses of Transformers.

Presented by

Collin Flynn

Principal Software Engineer

More from Collin
Collin Flynn

About Livefont

Our team creates beautiful mobile apps for people in motion. We work every day with Fortune 500 companies and startups alike to build custom software for phones, tablets, mobile payment wallets, and wearables like watches and glasses. We care deeply about quality, and we love what we do.