Stream-Based Active Learning for Regression with Dynamic Feature Selection

Davide Cacciarelli, John Solve Tyssedal, Murat Kulahci

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

In the era of big data, companies are increasingly driven to amass vast amounts of data, particularly in process industries where advanced sensor technologies are prevalent. However, obtaining accurate labels or product information through quality inspections can be prohibitively expensive. Active learning emerges as a promising approach to optimize data sampling by prioritizing the most informative data points. Nevertheless, active learning strategies heavily rely on predictive models that are iteratively updated. Aligning with the principles of data-centric AI, this study highlights the detrimental effects of passively incorporating all available process variables into a predictive model for guiding data collection. Specifically, in real-time sampling strategies based on online active learning, the inclusion of irrelevant features significantly hampers the efficiency of the learning process.
Original languageEnglish
Title of host publicationProceedings of 5th International Conference on Transdisciplinary AI (TransAI)
PublisherIEEE
Publication date2023
Pages243-248
ISBN (Print)979-8-3503-5802-5
DOIs
Publication statusPublished - 2023
Event2023 Fifth International Conference on Transdisciplinary AI - Hills Hotel, Laguna Hills, United States
Duration: 25 Sept 202327 Sept 2023

Conference

Conference2023 Fifth International Conference on Transdisciplinary AI
LocationHills Hotel
Country/TerritoryUnited States
CityLaguna Hills
Period25/09/202327/09/2023

Keywords

  • Data-centric AI
  • Active learning
  • Unlabeled data
  • Data streams
  • Feature selection
  • Design of experiments

Fingerprint

Dive into the research topics of 'Stream-Based Active Learning for Regression with Dynamic Feature Selection'. Together they form a unique fingerprint.

Cite this