Segmentation, diarization and speech transcription by