Tutorial T3
T3: Architecting fault-tolerant systems
Thursday 25th September, 13:30 - 17:00
Tutorial Leaders
Henry Muccini (IT)
Patrizio Pelliccione (IT)
Alexander Romanovsky (UK)
Description
Fault tolerance, being one of the four means for achieving dependability, is intended to ensure the delivery of the correct service in the presence of active faults. While typical fault tolerance solutions aim at extending the design and implementation phases of the software life-cycle (e.g., Java and Windows NT exception handling), more recently the need for explicit integration of fault tolerance solutions into the entire life cycle has been advocated by some researchers. For example, several solutions have been proposed for fault tolerance using exception handling at the software architecture and component levels. This tutorial describes how the concepts of fault tolerance and software architectures have been integrated so far. The tutorial is structured in five parts (Overview of the Software Architecture Domain, Overview of Fault Tolerance and Exception Handling, Integrating Fault Tolerance into Software Architecture, Coordinated Atomic Actions, Examples and Case Studies) and is based on our recent survey study in which more than fifteen approaches to architecting fault tolerant systems have been analysed and classified. The tutorial concludes by identifying the issues that are still open and require further investigation.
The tutorial has been structured to attract a wide range of software engineers. It does not require any specific knowledge on fault tolerance and exception handling. In order to provide a commonly understood definition of software architecture, the talk will introduce an initial informal meaning of software architecture. Tutorial attendants will receive in advance slides that will be presented during the tutorial. They will receive also useful bibliographies and links for further reading.
More information can be found at http://www.di.univaq.it/TutorialAFTS/
Short Bios
Henry Muccini is an Assistant professor in Computer Science at the University of L'Aquila, Italy. He got his PhD from the University of Rome - La Sapienza (Italy) and has been visiting assistant professor at Information & Computer Science, University of California, Irvine. His main research areas are on verification and validation, software architecture-based modeling and analysis, model-based testing, and fault tolerance. He has published over 50 conference and journal articles on these topics. He recently co-edited a book on "Software Engineering of Fault Tolerant Systems" and is currently co-editing another book on "Architecting Dependable System", vol. V. His research and development collaborations are with some research institutes and companies. Henry Muccini teaches different courses at the University of L'Aquila (Analysis and Testing of Component-based Systems and Architectures, Modeling Web Applications with the UML), he has tought courses on Software Quality and Component-based systems at the University of California Irvine.
Patrizio Pelliccione is an assistant professor at the University of L'Aquila, Computer Science Department. He got his PhD degree in the University of L'Aquila, computer science department, advisor Paola Inverardi, and thesis titled: "CHARMY: A framework for Software Architecture Specification and Analysis". Currently Patrizio is involved in the european project MANCOOSI (Managing the Complexity of the Open Source Infrastructure) of the Seventh Research Framework Programme (FP7), in the European project POPEYE (Professional Peer Environment beYond Edge computing) of the Sixth Framework Programme Priority 2, Information Society Technologies (FP6-2005- IST-5) and in the national project ArtDeco (Adaptive InfRasTructures for DECentralized Organizations), a three years Italian project - FIRB 2005. From April 2005 to April 2006 Patrizio was Senior Researcher at the Faculty of Sciences, Technologies and Communications of the University of Luxembourg. He was coordinator of the CORRECT project and Group that is part of the Software Engineering Competence Center (SE2C) created since 2001 by Prof. Nicolas Guelfi. CORRECT (rigorous stepwise development of Complex Fault TOlerant DistRibuted Systems: from ARchitEctural DesCription to Java ImplemenTation) is a three year (2004-2006) project entirely funded by Luxembourg Ministry of Higher Education and Research (ref. n. MEN/IST/04/04). Patrizio is author of more than 35 publications in international journals and conferences. The research topics are mainly in Software Architectures, Software Architectures Analysis, Component-based systems, Fault-tolerance, Middleware, Model checking, Formal Methods. In its research activity Patrizio collaborated with several industries such as Selex Marconi telecommunications, Ericsson, Siemens, TERMA, etc. Patrizio is chair of the ERCIM international workshop on Software Engineering for Resilient Systems (SERENE), is editor of a book: Software Engineering of Fault Tolerant Systems, and is reviewer of several workshops, conferences and journals.
Alexander (Sascha) Romanovsky is a specialist in system dependability and a Professor of Computer Science in Newcastle University. He received a M.Sc. degree in Applied Mathematics from Moscow State University and a PhD degree in Computer Science from St. Petersburg State Technical University. He was with this University from 1984 until 1996, doing research and teaching. In 1991 he worked as a visiting researcher at ABB Ltd Computer Architecture Lab Research Center, Switzerland. In 1993 he was a visiting fellow at Istituto di Elaborazione della Informazione, CNR, Pisa, Italy. In 1992-1998 he was involved in the Predictably Dependable Computing Systems (PDCS) ESPRIT Basic Research Action and the Design for Validation (DeVa) ESPRIT Basic Project. In 1998-2000 he worked on the Diversity in Safety Critical Software (DISCS) EPSRC/UK Project. Prof Romanovsky was a co-author of the Diversity with Off-The-Shelf Components (DOTS) EPSRC/UK Project and was involved in this project in 2001-2004. In 2004-2007 he coordinated Rigorous Open Development Environment for Complex Systems (RODIN) IST Project (2004-2007). He is now the Director of the major FP7 ICT integrated project on Industrial Deployment of System Engineering Methods Providing High Dependability and Productivity (DEPLOY). His main research interested are in fault tolerance, error recovery, exception handling, software architecture, rigorous design of fault tolerant systems, resilient system engineering.