New papers: Be my eyes on extending modality through multi-agent collaboration and omni-modal guardrail!