Publications

 

Koutsombogera Μ. and Papageorgiou Η. 2011. Iconic Gestures in face-to-face TV interviews, in Gesture in Embodied communnication and Human -computer interaction,  GW 2011, Athens, pp.147-151.

quote open

Abstract. This paper presents a study of iconic gestures as attested in a corpus of Greek face-to-face television interviews. This study takes place in an interactional context that is different from the narrative genre that has been primarily examined so far regarding the semantics and the communicative significance of the iconic gestures. We attempt to classify the iconic gestures attested according to their semantic equivalent, and link them to the phrase units of the accompanying speech, in order to draw some conclusions about the actual syntactical structures that induce them.
 

quote close

Kalimeri M., Constantoudis V., Papadimitriou C., Karamanos K., Diakonos F., Papageorgiou H. 2011. Entropy analysis of word-length series of natural language texts, accepted for publication in journal: International Journal of Bifurcation and Chaos.

quote open

Abstract. We estimate the n-gram entropies of natural language texts in word-length representation and find that these are sensitive to text language and genre. We attribute this sensitivity to changes in the probability distribution of the lengths of single words and emphasize the crucial role of the uniformity of probabilities of having words with length between five and ten. Furthermore, comparison with the entropies of shuffled data reveals the impact of word length correlations on the estimated n-gram entropies.
 

quote close

Koutsombogera M., Ammendrup S. M., Vilhjálmsson H. H., and Papageorgiou H. 2011. Non-verbal expressions of turn management in TV interviews: a cross-cultural study between Greek and Icelandic, in A. Esposito et al. (Eds.): Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces. Theoretical and Practical Issues Lecture Notes in Computer Science, 2011, Volume 6456/2011, 207-213.

quote open

Abstract. In this paper we discuss a cross-cultural analysis of non-verbal expressions (gestures, facial expressions, body posture) that have a turn managing function in the flow of  interaction. The study was carried out by analyzing and comparing the features of interest in two samples of institutional interaction, namely face-to-face political interviews, in Greek and Icelandic respectively. The non-verbal behavior of the participants in both interviews was annotated following the same annotation process. The attested turn management instances were compared in order to find similarities and differences in terms of frequency and modality preference. 

quote close

Gkoufas Y., Morou A. and Kalamboukis T. 2011. Combining Textual and Visual Information for Image Retrieval in the Medical Domain, in The Open Medical Informatics Journal, 2011, 5, 50-57.

quote open

Abstract: In this article we have assembled the experience obtained from our participation in the image CLEF evaluation task over the past two years. Exploitation on the use of linear combinations for image retrieval has been attempted by combining visual and textual sources of images. From our experiments we conclude that a mixed retrieval technique that applies both textual and visual retrieval in an interchangeably repeated manner improves the performance while overcoming the scalability limitations of visual retrieval. In particular, the mean average precision (MAP) has increased from 0.01 to 0.15 and 0.087 for 2009 and 2010 data, respectively, when content-based image retrieval (CBIR) is performed on the top 1000 results from textual retrieval based on natural language processing (NLP).

quote close

Papadimitriou C., Karamanos K., Diakonos F.K., Constantoudis V. and Papageorgiou H. 2010. Entropy analysis of natural language written texts, in Physica A: Statistical Mechanics and its Applications, article in press.

quote open

Abstract: The aim of the present work is to investigate the relative contribution of ordered and stochastic components in natural written texts and examine the influence of text category and language on these. To this end, a binary representation of written texts and the generated symbolic sequences are examined by standard block entropy analysis and the Shannon and Kolmogorov entropies are obtained. It is found that both entropies are sensitive to both language and text category with the text category sensitivity following almost the same trends in both languages (English and Greek). The values of these entropies are compared with those of stochastically generated symbolic sequences and the nature of correlations present in this representation of real written texts is identified.

quote close

Koutsombogera M. and Papageorgiou H. 2009. Multimodality Issues in Conversation Analysis of Greek TV Interviews, in A. Esposito et al. (Eds.): Multimodal Signals: Cognitive and Algorithmic Issues. Lecture Notes in Artificial Intelligence, 5398: 40-46, Springer-Verlag, Berlin Heidelberg, 2009.

quote open

Abstract: This paper presents a study on multimodal conversation analysis of Greek TV interviews. Specifically, we examine the type of facial, hand and body gestures and their respective communicative functions in terms of feedback and turn management. Taking into account previous work on the analysis of non-verbal interaction, we describe the tools and the coding scheme employed, we discuss the distribution of the features of interest and we investigate the effect of the situational and conversational interview setting on the interactional behavior of the participants. Finally, we conclude with comments on future work and exploitation of the resulting resource.

quote close

Demiros I., Carayannis G., Antonopoulos V., Kambourakis G., Katsouros V., Kolevris P., Nottas M., Papageorgiou H., Papavasiliou V., Raptis S., Simistira F., Stafylakis T. 2008. PANOPTIS: A System for Intelligent Monitoring of the Hellenic Broadcast Sector, First International Workshop on Automated Information Extraction in Media Production - AIEMPro08, Turin, Italy, 2008.

quote open

Abstract: In this paper we describe a system that applies emerging technologies for speech recognition, language processing, multimedia indexing and retrieval, all integrated into a large video and audio library that covers broadcast news and current affairs in Greece. It assists the Greek National Council for Radio and Television (NCRTV) in compiling information, annotating and analyzing news and monitoring national, political, social, economic, cultural and environmental issues concerning Greece in general. It further assists supervision of the broadcast A/V sector by offering citizens an efficient way to seek and get hold of, an ‘official’ copy of aired programming, in order to safeguard their interests and promote NCRTV’s goals.

quote close

Demiros I., Papageorgiou H., Antonopoulos V., Pipis A., Skoulariki A. 2008. Media Monitoring by Means of Speech and Language Indexing for Political Analysis, In Journal of Information Technology and Politics, vol. 5 no. 1, Spring 2008.

quote open

Abstract:In this paper, we describe a media monitoring system that we have developed and implemented for the Secretariat General of Communication and Secretariat General of Information in Greece (SGC-SGI). The system applies emerging technologies for audiovisual recording, speech recognition, language processing, multimedia indexing and retrieval, all integrated into a large video and audio library that covers broadcast news and current affairs in Greek and English. It assists SGC-SGI in compiling information, annotating and analyzing news and monitoring national, political, social, economic, cultural and environmental issues concerning Greece in general.

quote close

Lambropoulou P., Papageorgiou H., Georgantopoulos B., Tsagogeorga D., Demiros I., Antonopoulos V. 2008. Integrating language technology in a web enabled cultural heritage system, LREC 2008 Workshop on Language Technology for Cultural Heritage Data, June 2008, Marrakech, Morocco.

quote open

Abstract: This paper describes a web-enabled sophisticated Cultural Heritage (CH) system giving access to digital resources of various media, which exploits Language Technologies (LT) in order to enhance the performance of the search and retrieval mechanisms. More specifically, the paper presents the system requirements and architecture, drawing aspects from: (a) the cultural data repository and its particularities; (b) the unified metadata scheme that has been devised, integrating elements from various metadata standards, providing thus a rich description of the resources; (c) the thesauri (one of the major pillars of the system) that provide uniform access to the resources. The LT that form part of the system construction and use are presented in detail, focusing on the Term Extraction and Named Entity Recognition tools used in the construction of the thesauri and the metadata annotation process, and the Term Matching module exploited in the mining process for the identification of query terms which appear in a morphosyntactically similar form in the thesauri.

quote close

Papageorgiou H., Antonopoulos V., Demiros I., Gkiokas A. 2006. Thematic Classification and Intelligent Indexing of Broadcast News Using Speech Recognition and Image Analysis, EuroITV 2006, Athens, Greece.

quote open

Abstract: This paper addresses the development of a robust Multimedia Content Management System (MUSE) for analysis, indexing, retrieval and classification of large amounts of audiovisual content related to business news, aiming at personalized delivery of content as well as associated metadata over multiple devices. The system is a powerful tool in the hands of the world of media and television, video, news broadcasting, show business, advertisement, and any organization that produces, markets and/or broadcasts video and audio programs, facilitating common procedures of retrieving audio-visual material during research and production.

quote close

Papageorgiou H., Prokopidis P., Protopapas A., and Carayannis G. 2005. Multimedia Indexing and retrieval using Natural Language, Speech and Image Processing methods, in Giorgos Stamou & Stefanos Kollias Multimedia Content and Semantic Web: Methods, Standards and Tools, Wiley, 2005, Chapter 11, pp 279-297. 

quote open

Abstract: Throughout the chapter, we provide details on implementation issues of practical systems for efficient multimedia retrieval. Moreover, we exemplify algorithms and technologies by referring to practices and results of an EC-funded project called Combined IMage and WOrd Spotting (CIMWOS) developed with the hope that it would be a powerful tool in facilitating common procedures for intelligent indexing and retrieval of audiovisual material. CIMWOS used a multifaceted approach for the location of important segments within multimedia material employing state-of-the-art algorithms for text, speech and image processing in promoting reuse of audiovisual resources and reducing budgets of new productions. We focus on technologies specific to speech, text and image, respectively. These technologies incorporate efficient algorithms for processing and analysing relevant portions from various digital media and thus generating high-level semantic descriptors in the metadata space. After proposing an architecture for the integration of all the results of processing, we present indicative evaluation results in the context of CIMWOS.

quote close

Papageorgiou H., Prokopidis P., Demiros I., Hatzigeorgiou N., Carayannis G. 2004. CIMWOS: A Multimedia Retrieval System based on Combined Text, Speech and Image Processing, RIAO 2004, Coupling Approaches, Coupling Media and Coupling Languages for Information Retrieval, University of Avignon (Vaucluse), France.

quote open

Abstract: In this paper, we present a multimedia, multimodal and multilingual system that has been developed in the framework of the CIMWOS project, supporting content-based indexing, archiving, retrieval and on-demand delivery of audiovisual content. We have conducted research on emerging technologies for multimedia data processing, indexing and retrieval, and embedded them in a video library system that covers sports, broadcast news and documentaries in English, French and Greek. The CIMWOS Digital Video Library uses intelligent, automatic mechanisms that provide full-content search and retrieval from a large (scaling to several hours) online digital video library. The tools that we have developed during the project, can automatically populate the library and support access to it. The approach that we follow uses combined speech, language and image understanding technology to automatically transcribe, segment and index the video.

quote close

Antonopoulos V., Demiros I., Carayannis G., Piperidis S. 2004. Integrating Translation Technologies Towards a Powerful Translation Web Service, 2004 IEEE Conference on Cybernetics and Intelligent Systems, Singapore.

quote open

Abstract: Rapid changes in the global marketplace have given rise to new demands and have provided new opportunities for the translation industry. The need for multilinguality in the presentation and business logic layers of most modern systems, applications and services is a great challenge that the translation industry now faces. But even after many years of intense research and many commercial attempts of related products, today's translation systems still fail to completely meet the above needs. Within this framework, an architecture of a modern automatic translation system exploiting current infrastructure and covering present and future needs is proposed in this paper. 

quote close

Demiros I., Antonopoulos V., Georgantopoulos B., Triantafyllou Y., Piperidis S. 2001. Connectionist models for sentence-based text extracts, IEEE International Workshop on Natural Language processing and Knowledge Engineering (NLPKE-2001), in conjunction with the IEEE International Conference on Systems, Man, and Cybernetics SMC' 2001, Tucson, USA.

quote open

Abstract: This paper addresses the problem of creating a summary by extracting a set of sentences that are likely to represent the content of a document. A small scale experiment is conducted leading to the compilation of an evaluation corpus for the Greek language. Two models of sentence extraction are then described, along the lines of shallow linguistic analysis, feature combination and machine learning. Both models are based on term extraction and statistical filtering. After extracting the individual features of the text, we apply them to two neural networks that classify each sentence depending on its feature vector, the term weight being the feature with the best discriminant capacity. A three-layer feedforward network trained with the highly popular backpropagation algorithm and a competitive learning self-organizing map characterized by the formation of a topographic map, both trained on a small manually annotated corpus of summaries, perform the sentence extraction task. Both methods could be used for rapid light information retrieval-oriented summarization.  

quote close

Get in touch

Contact us to find
out more about our work.

Applications

Learn about the
services we offer.

aino

Learn the features of
Qualia's flagship product.