Institute of Perception, Action and Behaviour

Seminar Abstracts

Tom Larkworthy, Characterization of Self-Reconfiguring System State Spaces

Self-Reconfiguring Systems (SRS)(a.k.a. Transformers) offer various advantages over fixed shaped robots, but to realize these advantages a motion planning method is needed to control the self-reconfiguration process. SRSs have the ability to be scaled to systems with the tens of thousands of degrees of freedom, which far exceeds the capabilities of traditional planning approaches. Yet SRS are typically thousands of repeated modules on some uniform embedded space, symmetry is everywhere, so surely there exists an exploitable sub-structure?
One efficient approach has been to plan in groups of meta-modules. With suitable meta-module motion primitives designed by hand, the reconfiguration state space for the coarse meta-modules is simpler than the underlying model's cumbersome motion constraints. A drawback is that resolution is lost by the definition of axis aligned meta-modules. We have a better approach that adds just enough constraints to hide the difficult motion primitives, leading to a near linear algorithm for the hexagonal metamorphic robot. The planning simplifies for a sub-space, but by sacrificing less.
Planning efficiently for a highly abstracted SRS like the 2D hexagonal metamorphic robot is not particularly useful though. We need to understand *why* our approach yields the observed benefits. We have had some significant success de-mystifying the process of adding constraints to an SRS motion model to create a new motion model that is easier to plan with. In particular:
1. Let R = a raw SRS reconfiguration state space and C some other. C is a further constrained version of R iff R < C where < denotes the graph minor relation. The graph minor relation explains why some meta-module state spaces can be run on some underlying model. It concretely defines what is meant by sub-space in the SRS context.
2. Let C_n denote the reconfiguration state space for some model where n denotes the number of units in the configuration. Motion models that afford efficient planning solutions (e.g. meta-modularization and our new algorithm) are well ordered by the graph minor relation i.e. C_n < C_(n+1). Well-ordered states spaces by the graph minor relation explain why some spaces can be tackled in a recursive, local and iterative manner (all of which suggest the existence of some efficient planning strategy)
3. Models that are efficient to plan with are highly connected and do not contain bottlenecks (in a specific sense). High connectivity explains why a greedy planning methodology suffices for certain sub-spaces.
Thus we now understand when one state-space is a sub-space of another, and when a particular sub-space affords efficient planning strategies. These are broad rigorous principles that should inform SRS motion planning algorithm design across different SRS architectures. The door is now open to an SRS motion meta-planner.

Andreas Andreou, A micro-Doppler Sonar for Acoustic Scene Analysis

Fundamental to natural cognitive systems is the ability to detect and differentiate other living creatures in the world and to characterize their behavior. The spatio-temporal patterns of the body and its articulated components provide behavioral signatures and a means of communication among individuals within the environment. Sound is the primary medium for long distance passive and active interaction between animals and between animals and their environment; ranging from human speech communication to the active auditory scene analysis of bats and dolphins using bio-sonar. A component of the research in my lab is aimed bio-inspired autonomous acoustic scene analysis and decision making. In this talk I will introduce briefly a software/hardware a distributed sensor networked system [1], that is capable of forming composite representations of animate entities in the world exclusively through the use of information derived from sounds. The system employs sound in two ways. Firstly, actively through the emission, detection and processing of micro-Doppler sonar signals, the system is able to detect, identify and classify moving articulated objects in the environment. Secondly, passively through the processing and categorization of sounds emitted by the objects themselves, the system learns to recognize the acoustic communications of living entities and to associate these messages with their detected behavior. This situated cognitive system thus goes beyond human capabilities and is essentially an acoustic analogy to a camera-based visual scene analysis system; one which is particularly suited to detecting the presence and characterizing the behavior of living entities.
The bulk of the talk will focus on the micro-Doppler sonar system [2] for imaging moving articulated objects in the environment. The device is inspired by natural bio-sonar which echo-locating animals can use to locate, range and identify objects in their environment. Unlike much bio-sonar-inspired robotics work, which has primarily focused on object identification and navigation, we use it to acquire signatures that are employed in learning and classification of spectro-temporal patterns which characterize explicit movements. These patterns are detected as modulations of the frequency of the emitted sonar signal. Sonar technology complements cameras and visual surveillance in situations where the mere presence of life is relevant (for security or search and rescue reasons, for example), since although it depends upon a clear line 'of sight' between the detector and the object of interest it does not rely on visibility per se.
The velocity of a moving object relative to an observer can be estimated by measuring the frequency shift of a wave radiated or scattered by the object, known as the Doppler effect. If the object itself contains moving parts, each moving part will result in a modulation of the base shift (the micro- Doppler effect). For example, the frequency spectrum of acoustic or electromagnetic waves scattered from a walking person is a complex time-frequency representation of human gait. It includes not only the Doppler shifted components from the velocity of the entire body but also the micro-Doppler components from the motion of the arms and legs. In the case of an articulated body such as a walking person, the torso, each arm, and each leg has its own velocity, and even when the torso's velocity is constant, the velocity of the limbs changes over time. The Doppler signature for such a complex object has multiple time-dependent frequency shifted components corresponding to the velocity of the torso or an individual limb as a function of time. A two-dimensional representation of human gait can be obtained from the returned Doppler signal by applying a short-time Fourier transform (STFT) to the received signal

[1] D.H. Goldberg, A.G. Andreou, P. Julian, P.O. Pouliquen, L. Riddle and R. Rosasco, Algorithm and VLSI implementation of a wake-up subsystem for an acoustic surveillance sensor network, ACM Transactions on Sensor Networks, Vol. 2, No. 4, pp. 594-611, November 2006.
[2] Z. Zhang, P.O. Pouliquen, A. Waxman and A.G. Andreou, Acoustic micro-Doppler radar for human gait imaging, Journal of the Acoustical Society of America Express Letters, Vol. 121, No. 3, pp. 110-113, March 2007.

Paulina Varshavskaya, Modeling Team Tactics for Sports Science and Robocup

I will present ongoing work in modeling and learning tactical play patterns in team sports such as football. The problem is to represent, and learn from video demonstrations, implicit coordination between team players in an adversarial situation. This representation should model the abstract, invariant essence of the play tactic, while discarding any irrelevant specifics such as exact player positions on the pitch. We have two goals in mind. On the one hand, it should enable us to automatically extract, store and compare patterns and instances of team decision-making, to be used in sports science. On the other hand, the model should be able to generate player behavior in the context of a game, to be used for high-level control in robot soccer. We treat this problem as hidden state estimation in a Dynamic Bayes Net. In this talk, I will go over the problem and relevant prior work in machine vision, behavior analysis and robotics. I will also present our first experiments in developing these models and very preliminary results.
This work is part of the IDEAlab project "Automating and Enhancing Team Sports Performance Analysis".

Jan-Peter Calliess, No-Regret Learning and a Mechanism for Distributed Multiagent Planning

In this talk I will outline the conceptual ideas of a novel mechanism for coordinated, distributed multiagent planning. We considered problems stated as a collection of single-agent planning problems coupled by common soft constraints on resource consumption. A key idea is to recast the distributed planning problem as learning in a repeated game between the original agents and a newly introduced group of adversarial agents who influence prices for the resources. The adversarial agents are set up to benefit from arbitrage: that is, their incentive is to uncover violations of the resource usage constraints and, by selfishly adjusting prices, encourage the original agents to avoid plans that cause such violations. If all agents employ no-regret learning algorithms in the course of this repeated interaction, we are able to show that our mechanism can be set up to achieve design goals such as social optimality and Nash-equilibrium convergence to within an error which approaches zero as the agents gain experience. In particular, the agents' average plans converge to a socially optimal solution for the original planning task. As an illustrating application, we consider a multiagent-based source routing task that can be successfully solved with our coordination mechanism.

Yijun Xiao, 3D shape acquisition of objects in high motion using a stereo vision sensor

High-speed 3D shape acquisition is a cutting-edge research with many potential applications. In this talk, I'll introduce an application in bat behaviour study in the context of the EU CHIROPING project. First I'll give an overview of the project and describe the work to be carried out in Edinburgh. Then I'll talk about the 3D sensor we employed and present an empirical study of performance evaluation of the sensor. Results from real bat shape data we collected in Denmark and Panama will be demonstrated and discussed.

Joanna Young, Olfactory associative learning and locomotion in the fruit fly

In this talk I will give an overview of two projects that I am currently working on in the laboratory using the fruit fly Drosophila. Firstly, I will show some recent results on olfactory associative learning and secondly I will describe a project investigating a brain region called the Central Complex, which is thought to be involved in the higher control of locomotion in the fly.

Matt Howard, Transferring Impedance Control Strategies via Apprenticeship Learning

In this talk, I will describe my recent research in the direction of designing biomimetic controllers for variable impedance actuators in the context of the EU STIFF project. Specifically, I will present an imitation learning approach, whereby the goal is to learn impedance modulation strategies from recordings of behaviour (for example, that of humans) and transfer these to a robotic plant with very different actuators and dynamics. In contrast to previous approaches, where impedance characteristics are directly imitated, the method we propose uses task performance as the metric of imitation, ensuring that the learnt controllers are directly optimised for the hardware of the imitator. As a key ingredient, apprenticeship learning is used to model the optimisation criteria underlying observed behaviour, in order to frame a correspondent optimal control problem for the imitator. Using local optimal feedback control techniques, we can then find an appropriate impedance modulation strategy under the imitator's dynamics. I'll present some recent experiments testing the performance of the approach for transferring behaviour between systems with antagonistic actuation (including a biologically realistic two- joint, six-muscle model of the human arm) to robotic systems with controllable active or passive impedance.

Lucia Ballerini, Appearance based skin cancer diagnosis

DERMOFIT is a Wellcome Foundation funded research project. One goal of the project is develop a tool that will allow non-experts to diagnose skin lesions by taking advantage of the ability of humans to make visual matches even when they are not able to describe the lesions (using words) in a consistent way. In this talk I'll present our work within the DERMOFIT project.

I'll present recent results on the following studies:
1. A Query-by-Example Content-Based Image Retrieval System of Non-Melanoma Skin Lesions
In this part I'll focus on colour and texture features that have been extracted from skin lesions. I'll also present an evolutionary algorithm for composite feature synthesis.
2. Fuzzy Description of Skin Lesions
In this part I'll talk about a system for describing skin lesion based on a human perception model.
This has been developed by a MSc student.

Barbara Webb, What is associated with what in associative learning?

I will follow up an issue raised in my previous seminar (on non-elemental learning in insects) which is that despite a large number of behavioural experiments, neuroscientific investigations and computational models, there are a number of gaps and inconsistencies in accounts of associative learning which are hard to resolve. I'll illustrate with some supposedly 'simple' examples of learning in insects that we are trying to model. Depending on time, I will also use this issue to illustrate some of the general methodological pitfalls of 'agent' or 'animat' modelling (as discussed in my recent Adaptive Behaviour (vol 17 no 4) article "Animals vs. Animats").

Ricardo Gutierrez-Osuna, A system-wide model of the olfactory pathway for chemosensor arrays

In this talk, I will describe a computational model for chemical sensor arrays inspired by information processing in the olfactory system. First, I will present a model of sensory convergence that leads to spatial representations consistent with those observed in the olfactory bulb. Next, I will describe models of lateral inhibition in the olfactory bulb that provide concentration normalization and contrast enhancement of odor patterns. Finally, I will propose a model of bulb-cortex interactions that can be used to perform odor segmentation and background suppression. Our models are validated on experimental data from temperature-modulated metal-oxide sensor, optical microbead arrays, and infrared absorption spectroscopy. I will conclude with a brief discussion of our current work on active sensing with Partially Observable Markov Decision Processes.