Invited Talks.- Building Grid Applications and Portals: An Approach Based on Components, Web Services and Workflow Tools.- Maintaining Thousands of In-flight Instructions.- Some Techniques for Automated, Resource-Aware Distributed and Mobile Computing in a Multi-paradigm Programming System.- Why Structured Parallel Programming Matters.- Topic 1: Support Tools and Environments.- Topic 1: Support Tools and Environments.- Profiling and Tracing OpenMP Applications with POMP Based Monitoring Libraries.- Efficient Pattern Search in Large Traces Through Successive Refinement.- DRAC: Adaptive Control System with Hardware Performance Counters.- Reproducible Network Benchmarks with coNCePTuaL.- Optimizing Cache Access: A Tool for Source-to-Source Transformations and Real-Life Compiler Tests.- Detecting Data Races in Sequential Programs with DIOTA.- A Time-Coherent Model for the Steering of Parallel Simulations.- MATE: Dynamic Performance Tuning Environment.- Imprecise Exceptions in Distributed Parallel Components.- Topic 2: Performance Evaluation.- Topic 2: Performance Evaluation.- Overhead Compensation in Performance Profiling.- A Data Structure Oriented Monitoring Environment for Fortran OpenMP Programs.- An Approach for Symbolic Mapping of Memory References.- Impact of Cache Coherence Models on Performance of OpenMP Applications.- Evaluating OpenMP Performance Analysis Tools with the APART Test Suite.- Collective Communication Performance Analysis Within the Communication System.- Understanding the Behavior and Performance of Non-blocking Communications in MPI.- Generation of Simple Analytical Models for Message Passing Applications.- Parallel Hybrid Particle Simulations Using MPI and OpenMP.- Distributed Shared Memory: To Relax or Not to Relax?.- Design-Time Data-Access Analysis for Parallel Java Programs with Shared-Memory Communication Model.- Parallel PEPS Tool Performance Analysis Using Stochastic Automata Networks.- Topic 3: Scheduling and Load Balancing.- Topic 3: Scheduling and Load Balancing.- Scheduling Under Conditions of Uncertainty: A Bayesian Approach.- From Heterogeneous Task Scheduling to Heterogeneous Mixed Parallel Scheduling.- Scheduling of MPI Applications: Self-co-scheduling.- Scheduling Tasks Sharing Files from Distributed Repositories.- A Scheduling Algorithm for Running Bag-of-Tasks Data Mining Applications on the Grid.- Lookahead Scheduling for Reconfigurable GRID Systems.- Topic 4: Compilers for High Performance.- Topic 4: Compilers for High Performance.- More Legal Transformations for Locality.- Link-Time Optimization of IA64 Binaries.- A Polyhedral Approach to Ease the Composition of Program Transformations.- Compiler-Guided Code Restructuring for Improving Instruction TLB Energy Behavior.- Using Data Compression to Increase Energy Savings in Multi-bank Memories.- Automatic Customization of Embedded Applications for Enhanced Performance and Reduced Power Using Optimizing Compiler Techniques.- Architecture-Independent Meta-optimization by Aggressive Tail Splitting.- If-Conversion in SSA Form.- Topic 5: Parallel and Distributed Databases, Data Mining and Knowledge Discovery.- Topic 5: Parallel and Distributed Databases, Data Mining and Knowledge Discovery.- A Highly Scalable Parallel Caching System for Web Search Engine Results.- OLAP Query Processing in a Database Cluster.- Efficient Parallel Hierarchical Clustering.- Parallelizing EM Clustering Algorithm on a Cluster of SMPs.- A Parallel Knowledge Discovery System for Customer Profiling.- A Large-Scale Digital Library System to Integrate Heterogeneous Data of Distributed Databases.- Topic 6: Grid and Cluster Computing.- Topic 6: Grid and Cluster Computing.- Agreement-Based Interactions for Experimental Science.- A Double Auction Economic Model for Grid Services.- Towards a Monitoring Framework for Worldwide Grid Information Services.- A Model for a Component Based Grid-Aware Scientific Library Service.- Developing Grid-Aware Applications with DRMAA on Globus-Based Grids.- Exposing MPI Applications as Grid Services.- Globus-Based Grid Computing Simulations of Action Potential Propagation on Cardiac Tissues.- Profiling Grid Data Transfer Protocols and Servers.- Storage Power Management for Cluster Servers Using Remote Disk Access.- A Path Selection Based Algorithm for Maximizing Self-satisfiability of Requests in Real-Time Grid Applications.- Topic 7: Applications on High Performance Computers.- Topic 7: Applications on High Performance Computers.- Using a Structured Programming Environment for Parallel Remote Visualization.- Parallel and Grid Computing in 3D Analysis of Large Dimmension Structural Systems.- FlowVR: A Middleware for Large Scale Virtual Reality Applications.- Topic 8: Parallel Computer Architecture and ILP.- Topic 8: Parallel Computer Architecture and Instruction-Level Parallelism.- Exploring the Possibility of Operating in the Compressed Domain.- Management in First-Level Multibanked Data Caches.- A Mechanism for Verifying Data Speculation.- Feasibility of QoS for SMT.- Improving Data Cache Performance via Address Correlation: An Upper Bound Study.- Exploiting Spatial Store Locality Through Permission Caching in Software DSMs.- CacheFlow: A Short-Term Optimal Cache Management Policy for Data Driven Multithreading.- SCISM vs IA-64 Tagging: Differences/Code Density Effects.- Topic 9: Distributed Systems and Algorithms.- Topic 9: Distributed Systems and Algorithms.- Exploiting Differentiated Tuple Distribution in Shared Data Spaces.- Enhancing Efficiency of Byzantine-Tolerant Coordination Protocols via Hash Functions.- Accelerating Apache Farms Through Ad-HOC Distributed Scalable Object Repository.- A Proxy Server-Network for Real-Time Computer Games.- Topic 10: Parallel Programming: Models, Methods and Languages.- Topic 10: Parallel Programming: Models, Methods and Programming Languages.- Comparison of Parallelization Models for Structured Adaptive Mesh Refinement.- Task-Queue Based Hybrid Parallelism: A Case Study.- Co-array Python: A Parallel Extension to the Python Language.- Targeting Heterogeneous Architectures in ASSIST: Experimental Results.- A Fusion-Embedded Skeleton Library.- Cross Component Optimisation in a High Level Category-Based Language.- Concurrent Table Accesses in Parallel Tabled Logic Programs.- Handling Exceptions Between Parallel Objects.- Parallel Software Interoperability by Means of CORBA in the ASSIST Programming Environment.- Designing Parallel Operating Systems via Parallel Programming.- Topic 11: Numerical Algorithms.- Topic 11: Numerical Algorithms.- Reducing the Deteriorating Effect of Old History in Asynchronous Iterations.- Execution Schemes for Parallel Adams Methods.- A Data Management and Communication Layer for Adaptive, Hexahedral FEM.- A Parallel PSPG Finite Element Method for Direct Simulation of Incompressible Flow.- Load and Memory Balanced Mesh Partitioning for a Parallel Envelope Method.- Combining Explicit and Recursive Blocking for Solving Triangular Sylvester-Type Matrix Equations on Distributed Memory Platforms.- Topic 12: High Performance Multimedia.- Topic 12: High Performance Multimedia.- Towards User Transparent Data and Task Parallel Image and Video Processing: An Overview of the Parallel-Horus Project.- Visual Data Rectangular Memory.- Dynamic Granularity Switching in Parallel Block-Matching Motion Compensation.- Placement of Nodes in an Adaptive Distributed Multimedia Server.- A Practical Performance Analysis of Stream Reuse Techniques in Peer-to-Peer VoD Systems.- Supporting Caching and Mirroring in Distributed Video-on-Demand Architectures.- An Efficient Fair Queuing Algorithm for Guaranteed Disk Bandwidth.- Topic 13: Theory and Algorithms for Parallel Computation.- Topic 13: Theory and Algorithms for Parallel Computation.- Distributed Largest-First Algorithm for Graph Coloring.- Parallel Delaunay Refinement with Off-Centers.- Near-Optimal Hot-Potato Routing on Trees.- A Coarse-Grained Parallel Algorithm for Spanning Tree and Connected Components.- Topic 14: Routing and Communication in Interconnection Networks.- Topic 14: Routing and Communication in Interconnection Networks.- Implementing MPI on the BlueGene/L Supercomputer.- Modular On-chip Multiprocessor for Routing Applications.- OMULT: An Optical Interconnection System for Parallel Computing.- Distributing InfiniBand Forwarding Tables.- Tuning Buffer Size in InfiniBand to Guarantee QoS.- Reducing Power Consumption in Interconnection Networks by Dynamically Adjusting Link Width.- A Methodology to Evaluate the Effectiveness of Traffic Balancing Algorithms.- Load Unbalance in k-ary n-Cube Networks.- Understanding Buffer Management for Cut-Through 1D Rings.- Topic 15: Mobile Computing.- Topic 15: Mobile Computing.- Coverage and Connectivity in Networks with Directional Sensors.- Local Route Recovery Algorithms for Improving Multihop TCP Performance in Ad Hoc Wireless Networks.- Cluster Based Distributed Mutual Exclusion Algorithms for Mobile Networks.- An Improved Handoff Technique for a Seamless Multimedia Services.- An Improved Approximation Algorithm for the Minimum Energy Consumption Broadcast Subgraph.- UVOD: A Jini-Based Ubiquitous VOD Service Architecture.- Topic 16: Integrated Problem Solving Environments.- Topic 16: Integrated Problem Solving Environments.- Pattern/Operator Based Problem Solving Environments.- Databases, Workflows and the Grid in a Service Oriented Environment.- A Parallel Programming Tool for SAR Processors.- Topic 17: High Performance Bioinformatics.- Topic 17: High Performance Bioinformatics.- A Generic Parallel Pattern-Based System for Bioinformatics.- Parallel Inference of a 10.000-Taxon Phylogeny with Maximum Likelihood.- Cache-Based Parallelization of Multiple Sequence Alignment Problem.- Topic 18: Peer-to-Peer and Web Computing.- Topic 18: Peer-to-Peer and Web Computing.- The ncast Primitive for Peer-to-Peer Networks.- A P2P Grid Services-Based Protocol: Design and Evaluation.- A Synchronization and Session Control Based Protocol for 3D Multi-user Virtual Environments in a Peer-to-Peer Networks.- Large-Scale Deployment in P2P Experiments Using the JXTA Distributed Framework.- Experimental Study of Multi-criteria Scheduling Heuristics for GridRPC Systems.- Using Similarity Groups to Increase Performance of P2P Computing.- Improving the Scalability of Logarithmic-Degree DHT-Based Peer-to-Peer Networks.- A Small World Overlay Network for Resource Discovery.