Where Does Performance Go When Serving an LLM
A deep dive on where the cost lies at when serving llm models.
A deep dive on where the cost lies at when serving llm models.
Introduction Portfolio optimization has come a long way since Harry Markowitz introduced his groundbreaking mean-variance model in 1952.His model gave investors something powerful: a way to mathem...
As backend engineers, we often obsess over read performance. Caching strategies, query plans, indexes, pagination all the usual suspects. But every once in a while, a system reminds you of an uncom...
Introduction I stumbled across last month’s TidyTuesday analytical challenge featuring Netflix data, and curiosity got the better of me what exactly are we watching, and what patterns lie beneath ...
The Binomial Model Lets begin with the one Period Model: Model Description Time is denoted by 𝑡 and by definition we have two points in time $t=0(today)$, $t=1(tomorrow)$. In the model we have ...
Introduction This post was originally published on the Innova official blog. we are back at it with Linear Algebra,In of data science, engineering, and applied mathematics,least squares problems ...
Introduction This post was originally published on the Innova official blog. Formal specifications provide a clear framework for defining software behavior, yet connecting these specifications to ...
Introduction Most of my write ups emanate from stuff i do on a daily and curiosity leads me to explore.I was working on an implementation of multiple polynomial regression recently, I encountered ...
This post was originally published on the Innova official blog. Introduction. While reading Anthony Shaw’s CPython Internals to initially to understand concurrency and parallelism from the perspe...
From Raw Text to Intelligent Querying Introduction. I was exploring ways to implement memory in an LLM-powered REST API integration. The assistant’s role was to interpret a grid of numbers — most...