Toward Using Audio for Matching Transcoded Content
Metadata
- Publisher
- SMPTE
- Doc Type
- Journal Article
- Article Type
- orig-research
- Abstract
- With the advent of multiple screens for viewing media, transcoding is becoming a key component of content delivery ecosystems. But transcoding implies that copies and versions of the same content can proliferate across various storage devices. It also means that keeping track of content becomes a major problem from both copyright and recording/indexing perspectives. Video-based techniques for content indexing, where the aim is to extract robust signatures from video, have emerged as a major area of research. On the other hand, audio-based techniques have received less focus, but audio could provide robust signatures for indexing media while it undergoes transformations. This paper presents an investigation of audio signatures under typical transcoding operations. Specifically, mel-frequency cepstral coefficients (MFCCs) are examined as a signature, which has been widely used in audio recognition systems. Initial results indicate that MFCCs are quite robust.
- Publication Date
- 2014-01-01
- DOI
10.5594/j18367XY- Link
- https://doi.org/10.5594/j18367XY
- Author(s)
- Dinkar Bhat
Source Data (JSON)
Full registry record with provenance metadata. Open directly: /api/doc/10.5594-j18367XY.json
Reference this Doc
Plain text (ISO 690 compliant)
Preview:
Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at https://doi.org/10.5594/j18367XY
Snippet:
Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at https://doi.org/10.5594/j18367XY
HTML (ISO 690 compliant)
Preview:
Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at https://doi.org/10.5594/j18367XY
Snippet:
<span class="citation">Dinkar Bhat; <cite>Toward Using Audio for Matching Transcoded Content</cite>, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014. Available at <a href="https://doi.org/10.5594/j18367XY" target="_blank" rel="noopener">https://doi.org/10.5594/j18367XY</a></span>
SMPTE's HTML Pub
Preview:
Dinkar Bhat; Toward Using Audio for Matching Transcoded Content, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014
doi: 10.5594/j18367XY
url: https://doi.org/10.5594/j18367XY
doi: 10.5594/j18367XY
url: https://doi.org/10.5594/j18367XY
Snippet:
<li> Dinkar Bhat; <cite id="bib-10-5594-j18367xy">Toward Using Audio for Matching Transcoded Content</cite>, SMPTE Motion Imaging Journal ( Volume: 123, Issue: 1, 2014); SMPTE, 2014 <span class="doi">10.5594/j18367XY</span> </li>