Language-Guided Geospatial Algae Bloom Reasoning and Segmentation
Climate change is intensifying the occurrence of harmful algal blooms (HAB), particularly cyanobacteria, which threaten aquatic ecosystems and human health through oxygen depletion, toxin release, and disruption of marine biodiversity. Traditional monitoring approaches, such as manual water sampling, remain labor-intensive and limited in spatial and temporal coverage. Recent advances in vision-language models (VLMs) for remote sensing have shown potential for scalable AI-driven solutions, yet challenges remain in reasoning over imagery and quantifying bloom severity.
In this work, we introduce ALGOS (ALGae Observation and Segmentation), a segmentation-and-reasoning system for HAB monitoring that combines remote sensing image understanding with severity estimation. Our approach integrates GeoSAM-assisted human evaluation for high-quality segmentation mask curation and fine-tunes vision language models on severity prediction using the Cyanobacteria Aggregated Manual Labels (CAML) from NASA. Experiments demonstrate that ALGOS achieves robust performance on both segmentation and severity-level estimation, paving the way toward practical and automated cyanobacterial monitoring systems.
ALGOS is a unified vision-language framework that bridges reasoning segmentation with HAB severity assessment in satellite imagery.
ALGOS achieves strong performance across both segmentation and severity prediction tasks:
@article{hsieh2025segthehab,
title = {Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation},
author = {Patterson Hsieh and Jerry Yeh and Mao-Chi He and Wen-Han Hsieh and Elvis Hsieh},
journal = {NeurIPS 2025 Workshop on Tackling Climate Change with Machine Learning},
year = {2025},
url = {https://arxiv.org/abs/2510.18751}
}