Antonella Abbate • 17 June 2026

Who Is Storing the World's Vehicle Data?

For most of the motoring public, data collection begins and ends with speed cameras, toll roads and vehicle registration records. The reality, however, is far more complex. Modern vehicles have evolved into highly sophisticated computers on wheels, continuously generating information about their operation, location, condition and performance. As vehicle connectivity expands, so too does the volume of data being collected, transmitted and stored around the world.

The collection of vehicle-generated data is not new, but Australia's formal consideration of how this information could be utilised accelerated around 2019 when the National Transport Commission commenced significant work examining the future use, ownership and governance of vehicle-generated data. Governments recognised that connected vehicles would become an increasingly valuable source of information for road planning, infrastructure investment, congestion management and safety improvements.


The benefits are obvious. By analysing traffic flows, travel patterns and vehicle movements, road authorities can identify congestion points, improve traffic management systems and better target future infrastructure spending. Vehicle data can also assist in monitoring road conditions, evaluating the effectiveness of road upgrades and understanding how motorists interact with transport networks.


What is less obvious is the sheer volume of information involved.


To understand the scale, it helps to look at the numbers. A single petabyte of data is equivalent to approximately one thousand terabytes. In practical terms, one petabyte could store hundreds of millions of photographs or billions of pages of documents. Industry estimates suggest that connected vehicle networks can generate tens of petabytes of telemetry data each year. When the reporting section of Australia's more than 21 million registered vehicles are considered alongside tolling systems, traffic cameras, heavy vehicle telematics, infrastructure sensors and traffic management networks, it becomes clear that the nation's transport sector is generating information on a scale that would have been unimaginable only a decade ago.


Some transport and technology analysts estimate that Australia's broader transport ecosystem may already be producing data volumes measured in the tens of petabytes annually. If that figure is even remotely accurate, it raises an obvious question: if Australia is producing this amount of information, how much data is being generated globally?

The answer is staggering.

With more than 1.4 billion vehicles operating worldwide and millions of new connected vehicles entering service every year, the automotive sector has become one of the world's largest sources of machine-generated data. Industry experts now measure global automotive data production not in petabytes but in exabytes. To place that in perspective, one exabyte equals one thousand petabytes. The numbers quickly become almost impossible to comprehend.


The next question is perhaps even more important: where is all of this information being stored?


Contrary to popular belief, much of the world's vehicle data is not housed within government facilities or automotive company headquarters. Increasingly, it resides within vast cloud computing networks operated by a relatively small number of technology companies. The largest of these include Amazon Web Services (AWS), Microsoft Azure, Google Cloud, Alibaba Cloud and Oracle.


Amazon Web Services is currently regarded as the world's largest cloud computing provider and has become deeply embedded within the global automotive industry. Many manufacturers utilise AWS infrastructure to support connected vehicle services, over-the-air software updates, remote diagnostics and vehicle data analytics. In simple terms, AWS provides the digital warehouse while manufacturers retain ownership and control of the information stored within it.


This issue received attention within automotive circles when representatives associated with BYD New Zealand confirmed that elements of their connected vehicle ecosystem operate through Amazon's cloud infrastructure. BYD is by no means unique. Numerous vehicle manufacturers around the world rely upon the same cloud providers to support their connected vehicle platforms.


Importantly, the fact that information is stored within cloud networks does not necessarily mean that governments or technology companies know who is driving every vehicle. In many cases, data used for traffic analysis, infrastructure planning and vehicle performance monitoring is anonymised or aggregated. The vehicle itself may be identifiable through a digital identifier while the personal identity of the driver remains separated from the dataset.


However, privacy advocates continue to raise legitimate questions about what may be possible in the future. As artificial intelligence systems become increasingly capable and separate databases become more interconnected, concerns remain regarding the potential for information from multiple sources to be correlated. Vehicle location data, toll road records, registration information, mobile device data and camera systems could theoretically be combined to create a highly detailed picture of an individual's movements.


Whether such capabilities are currently being exercised is almost secondary to the broader discussion. The real debate centres on who ultimately owns vehicle-generated data, who has access to it, how long it is retained and what safeguards exist to protect motorists' privacy.

The automotive industry is rapidly moving towards software-defined vehicles where diagnostics, maintenance, safety systems, navigation, charging management and future autonomous driving functions all depend upon the continuous collection and analysis of data. In many respects, the modern vehicle is no longer simply a means of transportation. It has become a mobile data generator connected to a global digital infrastructure.


As governments, manufacturers and technology companies continue to expand their use of vehicle-generated information, the challenge for regulators will be ensuring that innovation and privacy remain in balance. What is certain is that the volume of automotive data being generated today is only a fraction of what will be produced tomorrow.


The connected car revolution is no longer coming. It has already arrived, and somewhere within some of the world's largest data centres, the digital footprint of modern motoring continues to grow every second.

Get to know IAME Member: 
Ian Barker – Five Decades of Automotive Excellence
by Antonella Abbate 15 June 2026
Get to know IAME Member: Ian Barker – Five Decades of Automotive Excellence
by Antonella Abbate 12 June 2026
Upcoming AIS Online outage AIS Online will be unavailable on Sunday 14 June from 1am to 1pm (if this date and time needs to change, we will notify you via AIS Online). AIS Online functionality There is no change to functionality; just some minor updates and you’ll simply log in via a new URL. When users attempt to access the old myrta URL You will be redirected to an information page explaining the update. The page will provide the new Transport URL to access AIS Online. Follow this link and log in as normal. What you need to do Update any bookmarks, shortcuts, or saved links that still point to the old myrta URL. This will ensure a smooth transition and help prevent access issues in the future. This update is part of ongoing efforts to improve security and consistency across Transport systems.
by Antonella Abbate 12 June 2026
We want to celebrate our members and share their journeys throughout their careers.