Verify that audio content matches expected content and validate questions against the audio transcription