Deep Symbolic Learning for Histogram-Valued Regression Data

  • Ilsuk Kang
  • , Donghwa Kim
  • , Hosik Choi
  • , Young Joo Yoon
  • , Cheolwoo Park

Research output: Contribution to journalArticlepeer-review

Abstract

This paper proposes the Deep Symbolic Learning (DSL) model, a deep learning-based framework for robust regression, specifically designed when both the response and predictors are histogram-valued variables. DSL utilizes cumulative distribution functions (CDFs) of covariate histograms within a one-dimensional convolutional neural network (1D-CNN) to transform the conditional density estimation problem into a multi-class classification task, optimized using the joint binary cross-entropy (JBCE) loss function. Extensive simulations and real-world applications, including air quality, traffic volume, and climate data, demonstrate that the DSL model outperforms existing methods across three key evaluation metrics: CDF distance, empirical coverage of the 90% prediction interval, and average quantile loss. This work contributes to the field of symbolic data analysis and conditional density estimation.

Original languageEnglish
Article numbere70033
JournalStatistical Analysis and Data Mining
Volume18
Issue number4
DOIs
StatePublished - Aug 2025

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 13 - Climate Action
    SDG 13 Climate Action

Keywords

  • conditional density estimation
  • convolutional neural networks
  • deep symbolic learning
  • histogram-valued data
  • JBCE loss

Fingerprint

Dive into the research topics of 'Deep Symbolic Learning for Histogram-Valued Regression Data'. Together they form a unique fingerprint.

Cite this