000 04512nam a22006255i 4500
001 978-3-540-30568-2
003 DE-He213
005 20240423125758.0
007 cr nn 008mamaa
008 100704s2005 gw | s |||| 0|eng d
020 _a9783540305682
_9978-3-540-30568-2
024 7 _a10.1007/b105752
_2doi
050 4 _aQA76.9.U83
050 4 _aQA76.9.H85
072 7 _aUYZ
_2bicssc
072 7 _aCOM079010
_2bisacsh
072 7 _aUYZ
_2thema
082 0 4 _a005.437
_223
082 0 4 _a004.019
_223
245 1 0 _aMachine Learning for Multimodal Interaction
_h[electronic resource] :
_bFirst International Workshop, MLMI 2004, Martigny, Switzerland, June 21-23, 2004, Revised Selected Papers /
_cedited by Samy Bengio, Hervé Bourlard.
250 _a1st ed. 2005.
264 1 _aBerlin, Heidelberg :
_bSpringer Berlin Heidelberg :
_bImprint: Springer,
_c2005.
300 _aXII, 362 p.
_bonline resource.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
347 _atext file
_bPDF
_2rda
490 1 _aInformation Systems and Applications, incl. Internet/Web, and HCI,
_x2946-1642 ;
_v3361
505 0 _aMLMI 2004 -- Accessing Multimodal Meeting Data: Systems, Problems and Possibilities -- Browsing Recorded Meetings with Ferret -- Meeting Modelling in the Context of Multimodal Research -- Artificial Companions -- Zakim – A Multimodal Software System for Large-Scale Teleconferencing -- Towards Computer Understanding of Human Interactions -- Multistream Dynamic Bayesian Network for Meeting Segmentation -- Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives -- An Integrated Framework for the Management of Video Collection -- The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing -- S-SEER: Selective Perception in a Multimodal Office Activity Recognition System -- Mapping from Speech to Images Using Continuous State Space Models -- An Online Algorithm for Hierarchical Phoneme Classification -- Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks -- Mixture of SVMs for Face Class Modeling -- AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking -- The 2004 ICSI-SRI-UW Meeting Recognition System -- On the Adequacy of Baseform Pronunciations and Pronunciation Variants -- Tandem Connectionist Feature Extraction for Conversational Speech Recognition -- Long-Term Temporal Features for Conversational Speech Recognition -- Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation -- Speech Transcription and Spoken Document Retrieval in Finnish -- A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System -- Shallow Dialogue Processing Using Machine Learning Algorithms (or Not) -- ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings -- Piecing Together the Emotion Jigsaw -- EmotionAnalysis in Man-Machine Interaction Systems -- A Hierarchical System for Recognition, Tracking and Pose Estimation -- Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques -- A Shape Based, Viewpoint Invariant Local Descriptor.
650 0 _aUser interfaces (Computer systems).
650 0 _aHuman-computer interaction.
650 0 _aArtificial intelligence.
650 0 _aNatural language processing (Computer science).
650 0 _aComputers and civilization.
650 0 _aComputer vision.
650 1 4 _aUser Interfaces and Human Computer Interaction.
650 2 4 _aArtificial Intelligence.
650 2 4 _aNatural Language Processing (NLP).
650 2 4 _aComputers and Society.
650 2 4 _aComputer Vision.
700 1 _aBengio, Samy.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
700 1 _aBourlard, Hervé.
_eeditor.
_4edt
_4http://id.loc.gov/vocabulary/relators/edt
710 2 _aSpringerLink (Online service)
773 0 _tSpringer Nature eBook
776 0 8 _iPrinted edition:
_z9783540245094
776 0 8 _iPrinted edition:
_z9783540807483
830 0 _aInformation Systems and Applications, incl. Internet/Web, and HCI,
_x2946-1642 ;
_v3361
856 4 0 _uhttps://doi.org/10.1007/b105752
912 _aZDB-2-SCS
912 _aZDB-2-SXCS
912 _aZDB-2-LNC
942 _cSPRINGER
999 _c181606
_d181606