Data science and machine learning is evolving from just focusing on predictive models, toward a more democratised, dynamic and data-centric discipline, said Gartner recently.
“This trend is now also fuelled by the fervour around generative AI,” Peter Krensky, Director Analyst at Gartner noted. “While potential risks are emerging, so too are the many new capabilities and use cases for data scientists and their organisations when it comes to data science and machine learning.”
According to Gartner, the top trends shaping the future of data science and machine learning include the followings.
Cloud data ecosystems
Data ecosystems are moving from self-contained software or blended deployments to full cloud-native solutions.
By 2024, Gartner expects 50% of new system deployments in the cloud will be based on a cohesive cloud data ecosystem rather than on manually integrated point solutions.
Gartner recommends organisations evaluate data ecosystems based on their ability to resolve distributed data challenges, as well as to access and integrate with data sources outside of their immediate environment.
Edge AI
Demand for Edge AI is growing to enable the processing of data at the point of creation at the edge, helping organisations to gain real-time insights, detect new patterns and meet stringent data privacy requirements.
Edge AI also helps organisations improve the development, orchestration, integration and deployment of AI.
Gartner predicts that more than 55% of all data analysis by deep neural networks will occur at the point of capture in an edge system by 2025, up from less than 10% in 2021.
Organisations should identify the applications, AI training and inferencing required to move to edge environments near IoT endpoints.
Responsible AI
Responsible AI makes AI a positive force, rather than a threat to society and to itself. It covers many aspects of making the right business and ethical choices when adopting AI that organisations often address independently, such as business and societal value, risk, trust, transparency and accountability.
Gartner predicts the concentration of pretrained AI models among 1% of AI vendors by 2025 will make responsible AI a societal concern.
Gartner recommends organisations adopt a risk-proportional approach to deliver AI value and take caution when applying solutions and models.
In addition, organisations need to seek assurances from vendors to ensure they are managing their risk and compliance obligations, protecting organizations from potential financial loss, legal action and reputational damage.
Data-centric AI
Data-centric AI represents a shift from a model and code-centric approach to being more data focused to build better AI systems. Solutions such as AI-specific data management, synthetic data and data labelling technologies, aim to solve many data challenges, including accessibility, volume, privacy, security, complexity and scope.
The use of generative AI to create synthetic data is one area that is rapidly growing, relieving the burden of obtaining real-world data so machine learning models can be trained effectively.
By 2024, Gartner predicts 60% of data for AI will be synthetic to simulate reality, future scenarios and derisk AI, up from 1% in 2021.
Accelerated AI investment
Investment in AI will continue to accelerate by organisations implementing solutions, as well as by industries looking to grow through AI technologies and AI-based businesses.
By the end of 2026, Gartner predicts that more than $10 billion will have been invested in AI startups that rely on foundation models – large AI models trained on huge amounts of data.
A recent Gartner poll of more than 2,500 executive leaders found that 45% reported that recent hype around ChatGPT prompted them to increase AI investments while 70% said their organisation is in investigation and exploration mode with generative AI and 19% are in pilot or production mode.