The bitter lesson in ML is that we must make methods that learn how to discover, not contain what we have discovered

source: comma ai

A couple of years ago, the idea of self-driving cars was primarily focused on the use of multiple sensors like LIDAR, radar, ultrasonic sensors, and cameras. These sensors were mounted on custom-made vehicles or heavily modified production cars to achieve full autonomous functionality. The combination of these sensors enabled the vehicles to detect obstacles, map their surroundings in real time, and make complex driving decisions.

However, recent advancements in the field have shifted towards refining sensor fusion technologies and optimizing AI algorithms to reduce reliance on expensive hardware like LIDAR. Companies like Tesla, for example, have been advocating for vision-based approaches that primarily rely on cameras and neural networks to achieve similar outcomes. Comma.ai on the other hand does something I find very interesting, using End-to-End Machine learning to solve self-driving cars.

Comma.ai’s heavy reliance on machine learning means its systems are designed to learn and adapt from real-world driving behavior. By collecting data from users’ vehicles, the company continuously improves its models, allowing for:

  • Better edge-case handling (uncommon scenarios).

  • Adaptive driving styles that match local norms.

  • A gradual progression toward higher levels of autonomy.

The task itself is extremely difficult, and I like the bold step they are taking to solve it.

The company was founded in 2017, by George Hotz, it is opensource, the idea here was to Build a superhuman driving agent by using a very concise plan

  • Get the data

  • Own the Network

  • Win self-driving cars

While the vision of Comma.ai is bold and ambitious, it’s also grounded in practicality. The company’s three-step plan — “Get the data, Own the network, Win self-driving cars” — encapsulates their approach to solving one of the most challenging problems in AI and robotics, yes, the company is a robotics company self-driving is just one of the many problems they will solve.

  1. Get the Data
  • Comma.ai recognizes that high-quality data is the backbone of any machine learning-based system. By enabling everyday drivers to use their hardware kits and contribute driving data through platforms like openpilot, they’ve created a scalable and cost-effective way to train their neural networks.

  • Unlike larger companies that rely on custom-built autonomous vehicles to collect data, Comma.ai leverages real-world driving scenarios from a wide array of vehicles and drivers, making their dataset diverse and adaptable.

2. Own the Network

  • Directly quoting from their blog, “Ownership of the network, and being able to push code to real users, is how we are going to win. We are closing the reinforcement learning loop on a grand scale, and this is what will push our model to superhuman.”

3. Win Self-Driving Cars

  • Comma.ai’s ultimate goal is to create a superhuman driving agent — a system that not only matches but surpasses human driving capabilities, probably level 5 autonomy.

  • By focusing on end-to-end machine learning, where raw sensor inputs are directly mapped to driving actions, the company aims to bypass the complexities of traditional rule-based systems and unlock a more fluid, adaptable driving experience.

Challenges Along the Way Link to heading

From my perspective, the challenges that comma.ai would face on the road to fully autonomous vehicles are:

  • Edge Cases: Handling rare or extreme scenarios, such as unpredictable pedestrian behavior or harsh weather conditions, remains a major hurdle, although they claimed that the models can generalize by geolocation.

  • Regulation: Convincing regulators to approve autonomous systems is as important as building the technology itself, these seems not to be an issue, hopefully, when the model becomes close to perfect.

  • Market Competition: Larger companies like Tesla, Waymo, and Cruise have significantly more resources and funding, creating fierce competition.

Despite these challenges, Comma.ai’s lightweight, data-driven approach allow them to carve out a unique niche in the industry. By focusing on retrofitting existing cars rather than building fleets from scratch, they position themselves as a company democratizing access to advanced driver-assistance systems (ADAS).

Broader Implications Link to heading

The work Comma.ai is doing raises an important question: Could smaller, nimble companies like this ultimately outperform larger players in certain areas of autonomous driving? While companies like Tesla aim for full self-driving vehicles integrated with their hardware, Comma.ai takes a more modular approach that empowers users to bring autonomy to vehicles they already own, which for me is a better way to it — Directly from their blog, “Building a car is stupid. The current cars are perfectly good at what they do, they just need to, well, drive themselves.”

A Unique Space in the Self-Driving Ecosystem Link to heading

Comma.ai’s approach of retrofitting existing cars places it in a unique position within the self-driving car ecosystem. Unlike companies like Waymo or Cruise, which focus on purpose-built autonomous fleets, or Tesla, which integrates autonomy into its proprietary vehicles, Comma.ai empowers everyday car owners to experience advanced driver assistance at a fraction of the cost.

This modular, adaptable strategy highlights the company’s philosophy: self-driving technology should be accessible, scalable, and affordable. By allowing users to convert their vehicles into semi-autonomous systems, Comma.ai effectively lowers the barrier to entry, democratizing innovation in an industry dominated by big-budget projects.

Open-Source Philosophy: A Game-Changer Link to heading

The open-source nature of Comma.ai’s flagship software, openpilot, has become a cornerstone of its success. This philosophy not only fosters transparency but also unlocks the potential for:

  1. Global Collaboration: Developers from around the world can contribute to refining the platform, addressing bugs, and building upon existing features.

  2. Rapid Innovation: With a growing community, the pace of development accelerates, allowing the company to iterate and improve faster than closed-system competitors.

  3. Customizability: Enthusiasts and developers can adapt openpilot to suit specific needs or integrate it into vehicles beyond those officially supported.

Vision Beyond Hardware Link to heading

Comma.ai’s broader vision goes beyond simply selling hardware kits. By creating an ecosystem where data, software, and community-driven innovation converge, the company is positioning itself as a pioneer in self-driving intelligence.

While hardware such as the Comma Three kit is critical for deployment, the company’s true strength lies in its ability to gather driving data at scale. Every user contributes to the refinement of their neural networks, making the system smarter over time.

The Importance of End-to-End Machine Learning Link to heading

End-to-end machine learning is one of Comma.ai’s most intriguing innovations. Unlike traditional self-driving systems that rely on modular pipelines (where perception, decision-making, and control are handled separately), end-to-end learning processes raw sensor inputs directly into driving actions.

This approach has several advantages:

  1. Simplicity: Reducing the complexity of traditional modular systems makes the codebase leaner and easier to maintain.

  2. Adaptability: End-to-end systems learn holistically, enabling them to handle complex, real-world driving scenarios that modular systems might struggle to address.

  3. Scalability: As the system is exposed to more data, it continuously improves without requiring manual tuning of individual modules.

However, this approach is not without its challenges. Training an end-to-end system to reliably handle all driving scenarios is immensely complex and requires vast amounts of data and computational resources. Yet, this bold step shows Comma.ai’s ambition to push the boundaries of what machine learning can achieve in self-driving technology.

The Future of Comma.ai Link to heading

While Comma.ai’s primary focus has been self-driving cars, the principles and methodologies they employ — such as end-to-end machine learning and open-source collaboration — are not limited to autonomous vehicles. These strategies can be applied to tackle broader robotics challenges, enabling the company to expand into other areas of automation and artificial intelligence.

End-to-End Machine Learning for Robotics Link to heading

Comma.ai’s reliance on end-to-end machine learning has transformative potential across the robotics field. Many traditional robotics systems rely on modular pipelines, similar to the older approaches in self-driving cars, where perception, planning, and control are handled separately. By contrast, Comma.ai’s approach streamlines these processes into a unified learning system.

This method could address key challenges in robotics, such as:

  • Complex Manipulation: Teaching robotic arms to handle intricate tasks in manufacturing or surgery.

  • Dynamic Environments: Training robots to navigate and adapt to unpredictable environments, like warehouses, disaster zones, or even homes.

  • Generalization: Creating systems that can learn and adapt to entirely new tasks without extensive reprogramming.

The Vision: Robotics as a Learning Problem Link to heading

George Hotz and the Comma.ai team view robotics not as a hardware problem, but as a learning problem. Their philosophy is that with the right data, network architecture, and optimization strategies, any robotic system can achieve superhuman performance. Just as they aim to build a superhuman driving agent, they believe the same principles can be applied to build superhuman robotic agents across industries.

Key components of this approach include:

  1. Data Collection at Scale: Just as openpilot collects driving data from users, other robotics systems could be deployed in real-world environments to continuously gather data.

  2. Iterative Improvement: Using community contributions and real-world feedback to train and refine models, similar to the open-source model of openpilot.

  3. Hardware-Agnostic Solutions: Developing software that works across a variety of robotic platforms, allowing for modular and scalable deployment.

Comma.ai’s principles — data-driven improvement, end-to-end learning, and community collaboration — position it as a potential leader in not just autonomous driving, but the broader robotics revolution. By leveraging their open-source philosophy and scaling their machine learning expertise, they could create solutions that transcend industries, bringing intelligent, adaptable robots into our everyday lives.

Their success in self-driving cars serves as a proof of concept: with the right approach, even the most complex problems in robotics can be tackled effectively, affordably, and at scale. The boldness of their vision — combined with their innovative strategies — suggests that the journey of Comma.ai is only just beginning.

If you like blog post like this, consider subscribing to The Nerd Stack