Notes

Notes I write to think more clearly, mostly on systems and performance. RSS · Atom.

Vision Language Action (VLA) Models

A look at how VLA models bridge perception and physical movement, from RT-2 to pi0.

· robotics, deep learning, computer vision · 8 min read