A Unified Multimodal Data Quality Classifier for generating quality scores for both image-text caption data and interleaved document data