Training Computer Vision Models to Track Rented Assets and Triage Issues: An Interview with Charlotte Bax, Founder and CEO of Captur AI

Encord

By 

Encord

Published 

Jun 8, 2022

Training Computer Vision Models to Track Rented Assets and Triage Issues: An Interview with Charlotte Bax, Founder and CEO of Captur AI

How can AI help solve real-world operations problems for businesses and their on-the-ground employees?

That’s the question that drove Charlotte Bax to start Captur

Founded in 2020, Captur uses computer vision to capture images of, and detect damage to, physical assets. Over the past two years, Captur has worked with major global players in micro mobility and last-mile delivery including Tier, Zapp, and Buzzbike, helping these companies access real-time data about their operations

Like most start-up founders, Bax wears many hats. Every day, she is responsible for making a variety of decisions. One day, in early 2021, Bax made the decision to use Encord to help build out Captur’s data pipeline infrastructure.

“When you’re running a startup, anything that’s core to your product, you build it. In any case where you can, you buy tools so that you can move faster,” says Bax. “Machine learning is core to our technology, but the reason for buying Encord was that it could help us improve the quality of the data going into our models and that would help us speed up our go-to-market measurably.”  

We recently caught up with Bax to hear more about the power of computer vision to assess the condition of remote assets, her experience using Encord, and her mission to bring applied AI to long-tail industries like operations.

The following interview has been edited for clarity and length.

Encord: Could you talk a little about the problem Captur is trying to solve?

Charlotte Bax: The problem we’re trying to solve is basically: how can companies track assets and resolve issues remotely? Ultimately making scaling logistics easier and more efficient. 

Most people are familiar with the experience of renting a car. Before you leave the parking lot, you walk around the vehicle with a representative from the rental car company. Together, you mark down any visible signs of damage. This inspection usually only lasts a few minutes. But what happens if something isn’t caught by either party? Who’s to say if the scratch on the passenger’s side door was there before the driver rented the car? It’s a lose- lose situation for both parties. Computer vision can help companies and users track that type of damage, removing the likelihood of human error.

That example is just the tip of the iceberg. Property, vehicles, construction equipment– there’s multiple different verticals that could benefit from computer vision providing real-time operations information about physical assets. As the on-demand business model grows, more and more companies will look to computer vision technologies because on-demand businesses–the ones where customers rent items on-demand–need new operating tools, ones that can enable them to efficiently scale operations that manage rented and shared physical items.

It might surprise you to learn that most companies with physical operations still rely on hand-built spreadsheets for asset tracking. This method lends itself to error and disputes. Also, spreadsheets don’t provide any real-time data or visibility about operations, so companies don’t have access to the data needed to develop basic metrics for making decisions about their business. Computer vision can not only help companies track the condition of their assets remotely, but it can also provide them with the data needed to automate their frontline decision making using AI. 

Encord: Let’s talk about some of Captur’s customers and how they’re using this technology.

Bax: We’ve been working a lot with micro mobility companies, such as e-scooters and e-bikes. We’re starting to move into some other asset classes, such as larger vehicles like vans and cargo bikes, and eventually, we aim to offer our service for any mobile asset.

Our product is API first, so for B2B customers, we’re embedded in their apps. With e-scooters, for instance, riders take a photo of the scooter at the  end of their trip and upload those images into the company’s app. 

Our technology automatically verifies these photos, and ultimately, Captur’s models provide an output in the form of a suggested action  based on that visual information. For instance, ‘Yes that’s good parking’, or ‘no that’s bad parking’. ‘Yes, the scooter is in good condition’, or ‘no it’s been damaged’. The goal is to provide the end user with real-time feedback on their device and prevent safety issues instantly. 

Encord: How valuable are data sets and training data for the future of your business?

Bax: Incredibly valuable. Our approach as a business has always been a data-centric approach which lets us build models in a way that’s transparent and collaborative. Think of it as building personalised machine learning for companies. We build up data sets that are very specialised– certain types of assets that have certain types of issues, damages, and conditions. These use cases are very specific. We’re able to do that in a scalable way across multiple different customers because we have a solid data review process and pipeline, which has been critical for both building models and providing our customers with transparency and confidence. 

Encord: Why and how do you use Encord?

Bax:  When we came across Encord, we were thinking about how we were going to scale our data annotation pipeline and data quality assurance. We wanted to be checking subsets of our models’ decisions at any given time to evaluate their performance.

That’s something we could have spent a solid nine months alone building in house, but we really liked what Encord could do. We have essentially built out a critical part of our own pipeline infrastructure in tandem with them. Their team has been real product development partners to us. 

We use Encord as a part of our training and evaluation pipeline. Both annotating new datasets and evaluating the results from live models. We use expert-in-the-loop feedback to improve the model’s performance as quickly as possible. 

In the beginning, we used a few different open source tools to annotate images for training models, but once we got the scale of moving beyond research projects and really needing a production-ready system, we moved towards using Encord because the additional features in the platform– like having a robust system to do quality control and a method for measuring the confidence scores of how much labellers agree– were ideal for scaling. 

We needed a platform that enabled various different types of labelling because we have different types of machine learning challenges. Our goal is to diagnose the condition of an object, which requires several steps that include individual issue detection– e.g. damage to a scooter frame– and the condition of where something is placed likewhere a scooter is parked. 

We have six people who actively work on Encord daily, and the Encord team have been great partners not just in helping us solve labelling challenges within the platform but also in helping us think about where this platform sits in the larger infrastructure that we’re building.

Overall, Encord’s vision for automating data labelling and training data management really aligns with our mission of bringing computer vision to this long-tail of industries like operations. Many industries just haven't been able to take advantage of AI or computer vision automation because the data set doesn't exist or the tools that enable them to do so don’t exist. Encord is really the start of the tool that's going to enable a ton of startups, like us, who are focused on bringing applied AI to every industry, not just healthcare and autonomous vehicles.

Related Posts