We introduce OneCAT, a unified multimodal model that seamlessly integrates understanding, generation, and editing within a novel, pure decoder-only transformer architecture. Our framework uniquely ...
Abstract: Advanced diffusion MRI (dMRI) models such as diffusion kurtosis imaging (DKI) and neurite orientation dispersion and density imaging (NODDI) provide rich microstructural information but ...
This project proposes a generative framework integrating diffusion transformers with a novel algebraic language representation, encoding 3D shell metamaterial geometries as mathematical sentences for ...
Abstract: Connectionist temporal classification (CTC) is one of the predominant schemes for end-to-end speech recognition because of its simplicity, efficiency and reliability. However, as a sequence ...
Methods: We evaluated 17 encoder and decoder models using J-CaseMap, a database of approximately 20,000 Japanese case reports annotated with clinical concepts. Performance was primarily assessed using ...