Toward Using Audio for Matching Transcoded Content

Metadata

Publisher: SMPTE — White Plains, NY, USA
Doc Type: Journal Article
Content Type: Original Research
Abbreviated Title: SMPTE Mot. Imag. J
Volume: 123, No. 1, pp. 14–19
Abstract: With the advent of multiple screens for viewing media, transcoding is becoming a key component of content delivery ecosystems. But transcoding implies that copies and versions of the same content can proliferate across various storage devices. It also means that keeping track of content becomes a major problem from both copyright and recording/indexing perspectives. Video-based techniques for content indexing, where the aim is to extract robust signatures from video, have emerged as a major area of research. On the other hand, audio-based techniques have received less focus, but audio could provide robust signatures for indexing media while it undergoes transformations. This paper presents an investigation of audio signatures under typical transcoding operations. Specifically, mel-frequency cepstral coefficients (MFCCs) are examined as a signature, which has been widely used in audio recognition systems. Initial results indicate that MFCCs are quite robust.
Publication Date: 2014-01-01
DOI: 10.5594/j18367XY
ISSN: Print: 1545-0279
Link: https://doi.org/10.5594/j18367XY
Author(s): Dinkar Bhat
Copyright: © 2014 Society of Motion Picture and Television Engineers, Inc.

Bibliographic Reference(s)

1. Ahmad I. , Wei X. , Sun Yu , “Video Transcoding: An Overview of Various Techniques and Research Issues,” IEEE Trans. Multimedia , 7 : 793 – 804 , 2007 . EXTERNAL
2. Hampapur A. , Hyun Ki-Ho , and Bolle R. , “Comparison of Sequence Matching Techniques for Video Copy Detection,” Proc. IEEE International Conference on Multimedia and Expo (ICME‘01) , pp . 737 – 740 , 2001 . EXTERNAL
3. Xie R. , Ding G. , Wang J. , “A New Fingerprint Sequences Matching Algorithm for Content-Based Copy Detection,” Proc. Fifth International Conference on Information Assurance and Security , 2009 . EXTERNAL
4. Mitrovic D. , Zeppelzauer M. , and Breiteneder C. , “Features for Content-Based Audio Retrieval,” Adv. Comput. , 78 : 71 – 150 , 2010 . EXTERNAL
5. Davis S. and Mermelstein P. , “Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences,” IEEE Trans. Acoustics, Speech, Signal Process. , vol. ASSP-28 , no. 4 , 1980 . EXTERNAL
6. Foote J. , “Visualizing Music and Audio Using Self-Similarity,” Proc. ACM Multimedia ‘99 , pp . 77 – 80 , 1999 . EXTERNAL

Source Data (JSON)

Full registry record with provenance metadata. Open directly: /api/doc/10.5594-j18367XY.json

Reference Tree

Explore all references and references to this document, as a navigable tree.

Open Reference Tree

Reference this Doc

Plain text (ISO 690 compliant)

Preview:

Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at https://doi.org/10.5594/j18367XY

Snippet:

Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at https://doi.org/10.5594/j18367XY

HTML (ISO 690 compliant)

Preview:

Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at https://doi.org/10.5594/j18367XY

Snippet:

<span class="citation">Dinkar Bhat; <cite>Toward Using Audio for Matching Transcoded Content</cite>, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at <a href="https://doi.org/10.5594/j18367XY" target="_blank" rel="noopener">https://doi.org/10.5594/j18367XY</a></span>

SMPTE's HTML Pub

Preview:

Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014
doi: 10.5594/j18367XY
url: https://doi.org/10.5594/j18367XY

Snippet:

<li>
Dinkar Bhat; <cite id="bib-10-5594-j18367xy">Toward Using Audio for Matching Transcoded Content</cite>, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014
<span class="doi">10.5594/j18367XY</span>
</li>