-
1
VIIDA and InViDe: computational approaches for generating and evaluating inclusive image paragraphs for the visually impaired
Published 2024“…</p> <p>We reviewed existing methods and developed VIIDA by integrating a multimodal Visual Question Answering model with Natural Language Processing (NLP) filters. A scene graph-based algorithm was then applied to structure final paragraphs. …”
-
2
Model output of different pairs of parameters.
Published 2025“…Successful segmentation models in computer vision, including graph-based algorithms and vision transformer, leverage similarity computations across all elements in an image, suggest that effective similarity-based grouping should rely on a global computational process. …”