Author: "Edmond S. L. Ho" / Topic: computer vision - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Edmond S. L. Ho"' showing total 27 results

Start Over Author "Edmond S. L. Ho" Topic computer vision

27 results on '"Edmond S. L. Ho"'

1. Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling

Author: Zhanxing Zhu, Hubert P. H. Shum, He Wang, and Edmond S. L. Ho
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer science, Movement, Feature extraction, Video Recording, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Motion (geometry), 02 engineering and technology, Machine Learning (cs.LG), Computer graphics, Computer Science - Graphics, Deep Learning, Computer Graphics, Image Processing, Computer-Assisted, 0202 electrical engineering, electronic engineering, information engineering, Humans, Computer vision, Visual artifact, Computer animation, G500, business.industry, G400, Deep learning, Frame (networking), Nonlinear dimensionality reduction, 020207 software engineering, Animation, Computer Graphics and Computer-Aided Design, Graphics (cs.GR), Signal Processing, Computer Vision and Pattern Recognition, Artificial intelligence, business, Software
Abstract: Data-driven modeling of human motions is ubiquitous in computer graphics and computer vision applications, such as synthesizing realistic motions or recognizing actions. Recent research has shown that such problems can be approached by learning a natural motion manifold using deep learning to address the shortcomings of traditional data-driven approaches. However, previous methods can be sub-optimal for two reasons. First, the skeletal information has not been fully utilized for feature extraction. Unlike images, it is difficult to define spatial proximity in skeletal motions in the way that deep networks can be applied. Second, motion is time-series data with strong multi-modal temporal correlations. A frame could be followed by several candidate frames leading to different motions; long-range dependencies exist where a number of frames in the beginning correlate to a number of frames later. Ineffective modeling would either under-estimate the multi-modality and variance, resulting in featureless mean motion or over-estimate them resulting in jittery motions. In this paper, we propose a new deep network to tackle these challenges by creating a natural motion manifold that is versatile for many applications. The network has a new spatial component for feature extraction. It is also equipped with a new batch prediction model that predicts a large number of frames at once, such that long-term temporally-based objective functions can be employed to correctly learn the motion multi-modality and variances. With our system, long-duration motions can be predicted/synthesized using an open-loop setup where the motion retains the dynamics accurately. It can also be used for denoising corrupted motions and synthesizing new motions with given control signals. We demonstrate that our system can create superior results comparing to existing work in multiple applications., 12 pages, Accepted in IEEE Transaction on Visualization and Computer Graphics
Published: 2021
Full Text: View/download PDF

2. Image editing-based data augmentation for illumination-insensitive background subtraction

Author: Garry Elvin, Dimitrios Sakkos, Hubert P. H. Shum, and Edmond S. L. Ho
Subjects: Background subtraction, Computer science, business.industry, G400, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, General Decision Sciences, 020207 software engineering, 02 engineering and technology, Image editing, computer.software_genre, Convolutional neural network, Management of Technology and Innovation, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, business, computer, Information Systems
Abstract: PurposeA core challenge in background subtraction (BGS) is handling videos with sudden illumination changes in consecutive frames. In our pilot study published in, Sakkos:SKIMA 2019, we tackle the problem from a data point-of-view using data augmentation. Our method performs data augmentation that not only creates endless data on the fly but also features semantic transformations of illumination which enhance the generalisation of the model.Design/methodology/approachIn our pilot study published in SKIMA 2019, the proposed framework successfully simulates flashes and shadows by applying the Euclidean distance transform over a binary mask generated randomly. In this paper, we further enhance the data augmentation framework by proposing new variations in image appearance both locally and globally.FindingsExperimental results demonstrate the contribution of the synthetics in the ability of the models to perform BGS even when significant illumination changes take place.Originality/valueSuch data augmentation allows us to effectively train an illumination-invariant deep learning model for BGS. We further propose a post-processing method that removes noise from the output binary map of segmentation, resulting in a cleaner, more accurate segmentation map that can generalise to multiple scenes of different conditions. We show that it is possible to train deep learning models even with very limited training samples. The source code of the project is made publicly available at https://github.com/dksakkos/illumination_augmentation
Published: 2020
Full Text: View/download PDF

3. 3DBodyNet: Fast Reconstruction of 3D Animatable Human Body Shape from a Single Commodity Depth Camera

Author: Pengpeng Hu, Adrian Munteanu, Edmond S. L. Ho, Multidimensional signal processing and communication, Faculty of Engineering, and Electronics and Informatics
Subjects: Operability, Computer science, business.industry, deep learning on point clouds, Deep learning, Process (computing), G900, 3d scanning, Human body shape, Computer Science Applications, Human-body model, Running time, Mobile phone, Body shape under clothing, Signal Processing, Media Technology, Computer vision, Artificial intelligence, 3D scanning, Electrical and Electronic Engineering, depth camera, Whole body, business
Abstract: Knowledge about individual body shape has numerous applications in various domains such as healthcare, fashion and personalized entertainment. Most of the depth based whole body scanners need multiple cameras surrounding the user and requiring the user to keep a canonical pose strictly during capturing depth images. These scanning devices are expensive and need professional knowledge for operation. In order to make 3D scanning as easy-to-use and fast as possible, there is a great demand to simplify the process and to reduce the hardware requirements. In this paper, we propose a deep learning algorithm,\ud dubbed 3DBodyNet, to rapidly reconstruct the 3D shape of human bodies using a single commodity depth camera. As easy-to-use as taking a photo using a mobile phone, our algorithm only needs two depth images of the front-facing and back-facing bodies. The proposed algorithm has strong operability since it is insensitive to the pose and the pose variations between the two depth images. It can also reconstruct an accurate body shape for users under tight/loose clothing. Another advantage of our method is the ability to generate an animatable human body model. Extensive experimental results show that the proposed method enables robust and easy-to-use animatable human body\ud reconstruction, and outperforms the state-of-the-art methods with respect to running time and accuracy.
Published: 2021

4. Emotion Transfer for 3D Hand and Full Body Motion using StarGAN

Author: Jacky C. P. Chan and Edmond S. L. Ho
Subjects: Computer Networks and Communications, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, emotion, hand animation, 02 engineering and technology, Translation (geometry), Motion capture, Motion (physics), lcsh:QA75.5-76.95, Task (project management), Naturalness, skeletal motion, Transfer (computing), body motion, 0202 electrical engineering, electronic engineering, information engineering, motion capture, Computer vision, Sequence, business.industry, G400, style transfer, generative adversarial network, 020207 software engineering, Motion synthesis, Human-Computer Interaction, user study, 020201 artificial intelligence & image processing, Artificial intelligence, lcsh:Electronic computers. Computer science, business
Abstract: In this paper, we propose a new data-driven framework for 3D hand and full-body motion emotion transfer. Specifically, we formulate the motion synthesis task as an image-to-image translation problem. By presenting a motion sequence as an image representation, the emotion can be transferred by our framework using StarGAN. To evaluate our proposed method’s effectiveness, we first conducted a user study to validate the perceived emotion from the captured and synthesized hand motions. We further evaluate the synthesized hand and full body motions qualitatively and quantitatively. Experimental results show that our synthesized motions are comparable to the captured motions and those created by an existing method in terms of naturalness and visual quality.
Published: 2021

5. Makeup Style Transfer on Low-quality Images with Weighted Multi-scale Attention

Author: Hubert P. H. Shum, Daniel Organisciak, and Edmond S. L. Ho
Subjects: Pixel, Channel (digital image), Computer science, business.industry, G400, Feature extraction, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 02 engineering and technology, Image segmentation, 010501 environmental sciences, 01 natural sciences, Facial recognition system, Kernel (image processing), Face (geometry), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, Scale (map), business, 0105 earth and related environmental sciences
Abstract: Facial makeup style transfer is an extremely challenging sub-field of image-to-image-translation. Due to this difficulty, state-of-the-art results are mostly reliant on the Face Parsing Algorithm, which segments a face into parts in order to easily extract makeup features. However, this algorithm can only work well on high-definition images where facial features can be accurately extracted. Faces in many real-world photos, such as those including a large background or multiple people, are typically of low-resolution, which considerably hinders state-of-the-art algorithms. In this paper, we propose an end-to-end holistic approach to effectively transfer makeup styles between two low-resolution images. The idea is built upon a novel weighted multi-scale spatial attention module, which identifies salient pixel regions on low-resolution images in multiple scales, and uses channel attention to determine the most effective attention map. This design provides two benefits: low-resolution images are usually blurry to different extents, so a multi-scale architecture can select the most effective convolution kernel size to implement spatial attention; makeup is applied on both a macro-level (foundation, fake tan) and a micro-level (eyeliner, lipstick) so different scales can excel in extracting different makeup features. We develop an Augmented CycleGAN network that embeds our attention modules at selected layers to most effectively transfer makeup. Our system is tested with the FBD data set, which consists of many low-resolution facial images, and demonstrate that it outperforms state-of-the-art methods, particularly in transferring makeup for blurry images and partially occluded images.
Published: 2021
Full Text: View/download PDF

6. Assessing Facial Symmetry and Attractiveness using Augmented Reality

Author: Robertas Damaševičius, Anna Esposito, Rytis Maskeliūnas, Kevin D. McCay, Wei Wei, Edmond S. L. Ho, Wei, W., Ho, E. S. L., Mccay, K. D., Damasevicius, R., Maskeliunas, R., and Esposito, A.
Subjects: Attractiveness, business.industry, Computer science, media_common.quotation_subject, G400, Mobile app, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Augmented reality, G600, Artificial Intelligence, Perception, Face (geometry), Pattern recognition (psychology), Benchmark (computing), Computer vision, Computer Vision and Pattern Recognition, Artificial intelligence, business, Set (psychology), Facial analysi, Facial symmetry, media_common, ComputingMethodologies_COMPUTERGRAPHICS
Abstract: Facial symmetry is a key component in quantifying the perception of beauty. In this paper, we propose a set of facial features computed from facial landmarks which can be extracted at a low computational cost. We quantitatively evaluated the proposed features for predicting perceived attractiveness from human portraits on four benchmark datasets (SCUT-FBP, SCUT-FBP5500, FACES and Chicago Face Database). Experimental results showed that the performance of the proposed features is comparable to those extracted from a set with much denser facial landmarks. The computation of facial features was also implemented as an augmented reality (AR) app developed on Android OS. The app overlays four types of measurements and guidelines over a live video stream, while the facial measurements are computed from the tracked facial landmarks at run time. The developed app can be used to assist plastic surgeons in assessing facial symmetry when planning reconstructive facial surgeries.
Published: 2021

7. Illumination-Aware Multi-Task GANs for Foreground Segmentation

Author: Dimitrios Sakkos, Edmond S. L. Ho, and Hubert P. H. Shum
Subjects: Background subtraction, General Computer Science, Computer science, business.industry, G400, video segmentation, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, General Engineering, multi-task learning, Multi-task learning, 020207 software engineering, 02 engineering and technology, Robustness (computer science), 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, General Materials Science, Segmentation, Computer vision, lcsh:Electrical engineering. Electronics. Nuclear engineering, Artificial intelligence, generative adversarial networks, illumination-aware, business, lcsh:TK1-9971
Abstract: Foreground-background segmentation has been an active research area over the years. However, conventional models fail to produce accurate results when challenged with the videos of challenging illumination conditions. In this paper, we present a robust model that allows accurately extracting the foreground even in exceptionally dark or bright scenes and in continuously varying illumination in a video sequence. This is accomplished by a triple multi-task generative adversarial network (TMT-GAN) that effectively models the semantic relationship between the dark and bright images and performs binary segmentation end-to-end. Our contribution is twofold: first, we show that by jointly optimizing the GAN loss and the segmentation loss, our network simultaneously learns both tasks that mutually benefit each other. Second, fusing features of images with varying illumination into the segmentation branch vastly improve the performance of the network. Comparative evaluations on highly challenging real and synthetic benchmark datasets (ESI and SABS) demonstrate the robustness of TMT-GAN and its superiority over state-of-the-art approaches.
Published: 2019
Full Text: View/download PDF

8. 3D Object Reconstruction from Imperfect Depth Data Using Extended YOLOv3 Network

Author: Robertas Damaševičius, Audrius Kulikajevas, Rytis Maskeliūnas, Edmond S. L. Ho, and MDPI AG (Basel, Switzerland)
Subjects: Data stream, Computer science, Point cloud, imperfect data, 02 engineering and technology, lcsh:Chemical technology, 01 natural sciences, Biochemistry, Article, Analytical Chemistry, 0202 electrical engineering, electronic engineering, information engineering, lcsh:TP1-1185, Computer vision, Segmentation, 3D scanning, Electrical and Electronic Engineering, Instrumentation, hybrid neural networks, RGB-D sensors, G500, business.industry, G400, 010401 analytical chemistry, Process (computing), artificial intelligence, Object (computer science), 3D shape reconstruction, Atomic and Molecular Physics, and Optics, 0104 chemical sciences, 020201 artificial intelligence & image processing, Augmented reality, Noise (video), Artificial intelligence, business
Abstract: State-of-the-art intelligent versatile applications provoke the usage of full 3D, depth-based streams, especially in the scenarios of intelligent remote control and communications, where virtual and augmented reality will soon become outdated and are forecasted to be replaced by point cloud streams providing explorable 3D environments of communication and industrial data. One of the most novel approaches employed in modern object reconstruction methods is to use a priori knowledge of the objects that are being reconstructed. Our approach is different as we strive to reconstruct a 3D object within much more difficult scenarios of limited data availability. Data stream is often limited by insufficient depth camera coverage and, as a result, the objects are occluded and data is lost. Our proposed hybrid artificial neural network modifications have improved the reconstruction results by 8.53% which allows us for much more precise filling of occluded object sides and reduction of noise during the process. Furthermore, the addition of object segmentation masks and the individual object instance classification is a leap forward towards a general-purpose scene reconstruction as opposed to a single object reconstruction task due to the ability to mask out overlapping object instances and using only masked object area in the reconstruction process.
Published: 2020

9. 3D Car Shape Reconstruction from a Single Sketch Image

Author: Naoki Nozawa, Hubert P. H. Shum, Edmond S. L. Ho, Shigeo Morishima, Shum, Hubert, Ho, Edmond, Cani, Marie-Paule, Popa, Tiberiu, Holden, Daniel, and Wang, He
Subjects: Artificial neural network, Computer science, business.industry, G400, Deep learning, 3D reconstruction, Autoencoder, Sketch, Lazy learning, Polygon mesh, Computer vision, Artificial intelligence, business, Computer animation, Subspace topology
Abstract: Efficient car shape design is a challenging problem in both the automotive industry and the computer animation/games industry. In this paper, we present a system to reconstruct the 3D car shape from a single 2D sketch image. To learn the correlation between 2D sketches and 3D cars, we propose a Variational Autoencoder deep neural network that takes a 2D sketch and generates a set of multi-view depth & mask images, which are more effective representation comparing to 3D mesh, and can be combined to form the 3D car shape. To ensure the volume and diversity of the training data, we propose a feature-preserving car mesh augmentation pipeline for data augmentation. Since deep learning has limited capacity to reconstruct fine-detail features, we propose a lazy learning approach that constructs a small subspace based on a few relevant car samples in the database. Due to the small size of such a subspace, fine details can be represented effectively with a small number of parameters. With a low-cost optimization process, a high-quality car with detailed features is created. Experimental results show that the system performs consistently to create highly realistic cars of substantially different shape and topology, with a very low computational cost.
Published: 2019
Full Text: View/download PDF

10. Prior-less 3D Human Shape Reconstruction with an Earth Mover’s Distance Informed CNN

Author: Jingtian Zhang, Edmond S. L. Ho, Kevin D. McCay, Hubert P. H. Shum, Shum, Hubert, Ho, Edmond, Cani, Marie-Paule, Popa, Tiberiu, Holden, Daniel, and Wang, He
Subjects: Ground truth, Computer science, business.industry, Deep learning, G400, 010401 analytical chemistry, Point cloud, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, 020207 software engineering, 02 engineering and technology, Function (mathematics), 01 natural sciences, 0104 chemical sciences, Image (mathematics), Parametric model, 0202 electrical engineering, electronic engineering, information engineering, Computer vision, Artificial intelligence, Representation (mathematics), business, Earth mover's distance
Abstract: We propose a novel end-to-end deep learning framework, capable of 3D human shape reconstruction from a 2D image without the need of a 3D prior parametric model. We employ a “prior-less” representation of the human shape using unordered point clouds. Due to the lack of prior information, comparing the generated and ground truth point clouds to evaluate the reconstruction error is challenging. We solve this problem by proposing an Earth Mover’s Distance (EMD) function to find the optimal mapping between point clouds. Our experimental results show that we are able to obtain a visually accurate estimation of the 3D human shape from a single 2D image, with some inaccuracy for heavily occluded parts.
Published: 2019

11. Illumination-Based Data Augmentation for Robust Background Subtraction

Author: Hubert P. H. Shum, Edmond S. L. Ho, and Dimitrios Sakkos
Subjects: FOS: Computer and information sciences, Background subtraction, On the fly, business.industry, Computer science, G400, Computer Vision and Pattern Recognition (cs.CV), Deep learning, Euclidean distance transform, Computer Science - Computer Vision and Pattern Recognition, Binary number, Convolutional neural network, Graphics (cs.GR), Computer Science - Graphics, Core (graph theory), Computer vision, Artificial intelligence, business
Abstract: A core challenge in background subtraction (BGS) is handling videos with sudden illumination changes in consecutive frames. In this paper, we tackle the problem from a data point-of-view using data augmentation. Our method performs data augmentation that not only creates endless data on the fly, but also features semantic transformations of illumination which enhance the generalisation of the model. It successfully simulates flashes and shadows by applying the Euclidean distance transform over a binary mask that is randomly generated. Such data allows us to effectively train an illumination-invariant deep learning model for BGS. Experimental results demonstrate the contribution of the synthetics in the ability of the models to perform BGS even when significant illumination changes take place. The source code of the project is made publicly available at https://github.com/dksakkos/illumination_augmentation., SKIMA 2019 - Best Paper Award
Published: 2019
Full Text: View/download PDF

12. Unsupervised abnormal behaviour detection with overhead crowd video

Author: Hubert P. H. Shum, Nauman Aslam, Shoujiang Xu, and Edmond S. L. Ho
Subjects: Training set, Computer science, business.industry, Feature extraction, Fully automatic, Trajectory, Training (meteorology), Overhead (computing), RGB color model, Computer vision, Artificial intelligence, Cluster analysis, business
Abstract: Due to the increasing threat of terrorism, it has become more and more important to detect abnormal behaviour in public areas. In this paper, we introduce a system to identify pedestrians with abnormal movement trajectories in a scene using a data-driven approach. Our system includes two parts. The first part is an interactive tool that takes an overhead video as an input and tracks the pedestrians in a semi-automatic manner. The second part is a data-driven abnormal trajectories detection algorithm, which applies iterative k-means clustering to find out possible paths in the scene and thereby identifies those that do not fit well in any paths. Since the system requires only RGB video, it is compatible with most of the closed-circuit television (CCTV) systems used for security monitoring. Furthermore, the training of the abnormal trajectories detection algorithm is unsupervised and fully automatic. It means that the system can be deployed into a new location without manual parameter tuning and training data annotations. The system can be applied in indoor and outdoor environments and is best for automatic security monitoring.
Published: 2017
Full Text: View/download PDF

13. Topology Aware Data-Driven Inverse Kinematics

Author: Yiu-ming Cheung, Pong C. Yuen, Edmond S. L. Ho, and Hubert P. H. Shum
Subjects: Inverse kinematics, business.industry, Computer science, Animation, Kinematics, Topology, Computer Graphics and Computer-Aided Design, Computer graphics, Real-time computer graphics, Computer vision, Artificial intelligence, business, Focus (optics), Computer facial animation, Simulation, Computer animation
Abstract: Creating realistic human movement is a time consuming and labour intensive task. The major difficulty is that the user has toedit individual joints while maintaining an overallrealistic and collisionfree posture.Previous research suggests the use of data-driven inverse kinematics, such that one can focus on the control of a few joints, while the system automatically composes a natural posture. However, as a common problem of kinematics synthesis, penetration of body parts is difficult to avoid in complex movements. In this paper, we propose a new data-driven inverse kinematics framework that conserves the topology of the synthesizing postures. Our system monitors and regulates the topology changes using the Gauss Linking Integral (GLI), such that penetration can be efficiently prevented. As a result, complex motions with tight body movements, as well as those involving interaction with external objects, can be simulated with minimal manual intervention. Experimental results show that using our system, the user can create high quality human motion in real-time by controlling a few joints using a mouse or a multi-touch screen. The movement generated is both realistic and penetration free. Our system is best applied for interactive motion design in computer animations and games.
Published: 2013
Full Text: View/download PDF

14. Real-Time Posture Reconstruction for Microsoft Kinect

Author: Hubert P. H. Shum, Yang Jiang, Edmond S. L. Ho, and Shu Takagi
Subjects: Computer science, Posture, Transducers, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Kinematics, Virtual reality, Motion capture, Motion (physics), Pattern Recognition, Automated, Imaging, Three-Dimensional, Artificial Intelligence, Computer Systems, Humans, Computer Simulation, Whole Body Imaging, Computer vision, Electrical and Electronic Engineering, Visual artifact, Computer Peripherals, Pose, ComputingMethodologies_COMPUTERGRAPHICS, business.industry, Usability, Image Enhancement, Actigraphy, Computer Science Applications, Human-Computer Interaction, Video Games, Control and Systems Engineering, Pattern recognition (psychology), Artificial intelligence, business, Algorithms, Software, Information Systems
Abstract: The recent advancement of motion recognition using Microsoft Kinect stimulates many new ideas in motion capture and virtual reality applications. Utilizing a pattern recognition algorithm, Kinect can determine the positions of different body parts from the user. However, due to the use of a single-depth camera, recognition accuracy drops significantly when the parts are occluded. This hugely limits the usability of applications that involve interaction with external objects, such as sport training or exercising systems. The problem becomes more critical when Kinect incorrectly perceives body parts. This is because applications have limited information about the recognition correctness, and using those parts to synthesize body postures would result in serious visual artifacts. In this paper, we propose a new method to reconstruct valid movement from incomplete and noisy postures captured by Kinect. We first design a set of measurements that objectively evaluates the degree of reliability on each tracked body part. By incorporating the reliability estimation into a motion database query during run time, we obtain a set of similar postures that are kinematically valid. These postures are used to construct a latent space, which is known as the natural posture space in our system, with local principle component analysis. We finally apply frame-based optimization in the space to synthesize a new posture that closely resembles the true user posture while satisfying kinematic constraints. Experimental results show that our method can significantly improve the quality of the recognized posture under severely occluded environments, such as a person exercising with a basketball or moving in a small room.
Published: 2013
Full Text: View/download PDF

15. Interactive partner control in close interactions for real-time applications

Author: Howard Leung, Edmond S. L. Ho, Taku Komura, and Jacky C. P. Chan
Subjects: 0209 industrial biotechnology, Computer Networks and Communications, Computer science, business.industry, 020207 software engineering, 02 engineering and technology, Motion capture, Motion (physics), Variety (cybernetics), 020901 industrial engineering & automation, Character (mathematics), Hardware and Architecture, Human–computer interaction, 0202 electrical engineering, electronic engineering, information engineering, Character animation, Computer vision, Artificial intelligence, Adaptation (computer science), business, Representation (mathematics), Control (linguistics)
Abstract: This article presents a new framework for synthesizing motion of a virtual character in response to the actions performed by a user-controlled character in real time. In particular, the proposed method can handle scenes in which the characters are closely interacting with each other such as those in partner dancing and fighting. In such interactions, coordinating the virtual characters with the human player automatically is extremely difficult because the system has to predict the intention of the player character. In addition, the style variations from different users affect the accuracy in recognizing the movements of the player character when determining the responses of the virtual character. To solve these problems, our framework makes use of the spatial relationship-based representation of the body parts called interaction mesh, which has been proven effective for motion adaptation. The method is computationally efficient, enabling real-time character control for interactive applications. We demonstrate its effectiveness and versatility in synthesizing a wide variety of motions with close interactions.
Published: 2013
Full Text: View/download PDF

16. SkillVis: a visualization tool for boxing skill assessment

Author: Hubert P. H. Shum, He Wang, Edmond S. L. Ho, and Taku Komura
Subjects: Motion analysis, business.industry, Computer science, Dimensionality reduction, Sports science, G400, 020207 software engineering, 02 engineering and technology, Motion capture, Visualization, Information visualization, 3d space, Human–computer interaction, 0202 electrical engineering, electronic engineering, information engineering, Graph (abstract data type), 020201 artificial intelligence & image processing, Computer vision, Artificial intelligence, business
Abstract: Motion analysis and visualization are crucial in sports science for sports training and performance evaluation. While primitive computational methods have been proposed for simple analysis such as postures and movements, few can evaluate the high-level quality of sports players such as their skill levels and strategies. We propose a visualization tool to help visualizing boxers' motions and assess their skill levels. Our system automatically builds a graph-based representation from motion capture data and reduces the dimension of the graph onto a 3D space so that it can be easily visualized and understood. In particular, our system allows easy understanding of the boxer's boxing behaviours, preferred actions, potential strength and weakness. We demonstrate the effectiveness of our system on different boxers' motions. Our system not only serves as a tool for visualization, it also provides intuitive motion analysis that can be further used beyond sports science.
Published: 2016
Full Text: View/download PDF

17. Spatial relationship preserving character motion adaptation

Author: Chiew-Lan Tai, Edmond S. L. Ho, and Taku Komura
Subjects: Semantics (computer science), business.industry, Computer science, Animation, Computer Graphics and Computer-Aided Design, Motion (physics), Character (mathematics), Retargeting, Character animation, Polygon mesh, Computer vision, Artificial intelligence, business, Representation (mathematics), ComputingMethodologies_COMPUTERGRAPHICS
Abstract: This paper presents a new method for editing and retargeting motions that involve close interactions between body parts of single or multiple articulated characters, such as dancing, wrestling, and sword fighting, or between characters and a restricted environment, such as getting into a car. In such motions, the implicit spatial relationships between body parts/objects are important for capturing the scene semantics. We introduce a simple structure called an interaction mesh to represent such spatial relationships. By minimizing the local deformation of the interaction meshes of animation frames, such relationships are preserved during motion editing while reducing the number of inappropriate interpenetrations. The interaction mesh representation is general and applicable to various kinds of close interactions. It also works well for interactions involving contacts and tangles as well as those without any contacts. The method is computationally efficient, allowing real-time character control. We demonstrate its effectiveness and versatility in synthesizing a wide variety of motions with close interactions.
Published: 2010
Full Text: View/download PDF

18. Indexing and Retrieving Motions of Characters in Close Contact

Author: Edmond S. L. Ho and Taku Komura
Subjects: Similarity (geometry), Abstracting and Indexing, Computer science, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Information Storage and Retrieval, Models, Biological, Sensitivity and Specificity, Pattern Recognition, Automated, Computer graphics, Imaging, Three-Dimensional, Image Interpretation, Computer-Assisted, Computer Graphics, Humans, Computer Simulation, Computer vision, Computer animation, ComputingMethodologies_COMPUTERGRAPHICS, business.industry, Search engine indexing, Reproducibility of Results, Numerical Analysis, Computer-Assisted, Animation, Image Enhancement, Computer Graphics and Computer-Aided Design, Euclidean distance, Signal Processing, Character animation, Graph (abstract data type), Computer Vision and Pattern Recognition, Artificial intelligence, business, Algorithms, Software
Abstract: Human motion indexing and retrieval are important for animators due to the need to search for motions in the database which can be blended and concatenated. Most of the previous researches of human motion indexing and retrieval compute the Euclidean distance of joint angles or joint positions. Such approaches are difficult to apply for cases in which multiple characters are closely interacting with each other, as the relationships of the characters are not encoded in the representation. In this research, we propose a topology-based approach to index the motions of two human characters in close contact. We compute and encode how the two bodies are tangled based on the concept of rational tangles. The encoded relationships, which we define as TangleList, are used to determine the similarity of the pairs of postures. Using our method, we can index and retrieve motions such as one person piggy-backing another, one person assisting another in walking, and two persons dancing / wrestling. Our method is useful to manage a motion database of multiple characters. We can also produce motion graph structures of two characters closely interacting with each other by interpolating and concatenating topologically similar postures and motion clips, which are applicable to 3D computer games and computer animation.
Published: 2009
Full Text: View/download PDF

19. Improving posture classification accuracy for depth sensor-based human activity monitoring in smart environments

Author: Pong C. Yuen, Yiu-ming Cheung, Donald C.K. Chan, Jacky C. P. Chan, Hubert P. H. Shum, and Edmond S. L. Ho
Subjects: Computer science, Posture classification, 02 engineering and technology, Workspace, Machine learning, computer.software_genre, Bottleneck, Max-margin classification, Robustness (computer science), 0202 electrical engineering, electronic engineering, information engineering, Depth camera, Computer vision, Smart environments, Monitoring systems, business.industry, Reliability estimation, G400, 020207 software engineering, Monitoring system, Body movement, Software deployment, Signal Processing, 020201 artificial intelligence & image processing, Smart environment, Artificial intelligence, Computer Vision and Pattern Recognition, business, computer, Classifier (UML), Software
Abstract: A new posture classification framework for Kinect is proposed.Accuracy in classifying noisy postures is improved by considering the reliability of each joint.Reliability of a joint can be evaluated by the consistency in different aspects over time.Performance of classifier is improved by learning the weights of reliability terms. Smart environments and monitoring systems are popular research areas nowadays due to its potential to enhance the quality of life. Applications such as human behavior analysis and workspace ergonomics monitoring are automated, thereby improving well-being of individuals with minimal running cost. The central problem of smart environments is to understand what the user is doing in order to provide the appropriate support. While it is difficult to obtain information of full body movement in the past, depth camera based motion sensing technology such as Kinect has made it possible to obtain 3D posture without complex setup. This has fused a large number of research projects to apply Kinect in smart environments. The common bottleneck of these researches is the high amount of errors in the detected joint positions, which would result in inaccurate analysis and false alarms. In this paper, we propose a framework that accurately classifies the nature of the 3D postures obtained by Kinect using a max-margin classifier. Different from previous work in the area, we integrate the information about the reliability of the tracked joints in order to enhance the accuracy and robustness of our framework. As a result, apart from general classifying activity of different movement context, our proposed method can classify the subtle differences between correctly performed and incorrectly performed movement in the same context. We demonstrate how our framework can be applied to evaluate the user's posture and identify the postures that may result in musculoskeletal disorders. Such a system can be used in workplace such as offices and factories to reduce risk of injury. Experimental results have shown that our method consistently outperforms existing algorithms in both activity classification and posture healthiness classification. Due to the low cost and the easy deployment process of depth camera based motion sensors, our framework can be applied widely in home and office to facilitate smart environments.
Published: 2016

20. Modeling Spatial Relations of Human Body Parts for Indexing and Retrieving Close Character Interactions

Author: Jacky C. P. Chan, Yiu-ming Cheung, Pong C. Yuen, and Edmond S. L. Ho
Subjects: Computer science, business.industry, G400, Search engine indexing, Human body, Virtual reality, Motion (physics), Computer graphics, Spatial relation, Pairwise comparison, Computer vision, Artificial intelligence, Representation (mathematics), business
Abstract: Retrieving pre-captured human motion for analyzing and synthesizing virtual character movement have been widely used in Virtual Reality (VR) and interactive computer graphics applications. In this paper, we propose a new human pose representation, called Spatial Relations of Human Body Parts (SRBP), to represent spatial relations between body parts of the subject(s), which intuitively describes how much the body parts are interacting with each other. Since SRBP is computed from the local structure (i.e. multiple body parts in proximity) of the pose instead of the information from individual or pairwise joints as in previous approaches, the new representation is robust to minor variations of individual joint location. Experimental results show that SRBP outperforms the existing skeleton-based motion retrieval and classification approaches on benchmark databases.
Published: 2015

21. An Energy-Driven Motion Planning Method for Two Distant Postures

Author: Edmond S. L. Ho, Taku Komura, and He Wang
Subjects: Computer science, business.industry, G400, ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, Robotics, Graph theory, Animation, Computer Graphics and Computer-Aided Design, Computer Science::Robotics, Motion field, Motion estimation, Signal Processing, Character animation, Graph (abstract data type), Computer vision, Computer Vision and Pattern Recognition, Artificial intelligence, Motion planning, Configuration space, business, Software, Computer animation, ComputingMethodologies_COMPUTERGRAPHICS
Abstract: In this paper, we present a local motion planning algorithm for character animation. We focus on motion planning between two distant postures where linear interpolation leads to penetrations. Our framework has two stages. The motion planning problem is first solved as a Boundary Value Problem (BVP) on an energy graph which encodes penetrations, motion smoothness and user control. Having established a mapping from the configuration space to the energy graph, a fast and robust local motion planning algorithm is introduced to solve the BVP to generate motions that could only previously be computed by global planning methods. In the second stage, a projection of the solution motion onto a constraint manifold is proposed for more user control. Our method can be integrated into current keyframing techniques. It also has potential applications in motion planning problems in robotics.
Published: 2015

22. A multi-resolution approach for adapting close character interaction

Author: Edmond S. L. Ho, Taku Komura, and He Wang
Subjects: Optimization problem, Computer science, business.industry, Computation, Motion (physics), Domain (software engineering), Retargeting, Character animation, Computer vision, Artificial intelligence, business, Humanoid robot, Computer animation, ComputingMethodologies_COMPUTERGRAPHICS
Abstract: Synthesizing close interactions such as dancing and fighting between characters is a challenging problem in computer animation. While encouraging results are presented in [Ho et al. 2010], the high computation cost makes the method unsuitable for interactive motion editing and synthesis. In this paper, we propose an efficient multiresolution approach in the temporal domain for editing and adapting close character interactions based on the Interaction Mesh framework. In particular, we divide the original large spacetime optimization problem into multiple smaller problems such that the user can observe the adapted motion while playing-back the movements during run-time. Our approach is highly parallelizable, and achieves high performance by making use of multi-core architectures. The method can be applied to a wide range of applications including motion editing systems for animators and motion retargeting systems for humanoid robots.
Published: 2014
Full Text: View/download PDF

23. Motion adaptation for humanoid robots in constrained environments

Author: Hubert P. H. Shum and Edmond S. L. Ho
Subjects: Rescue robot, Motion compensation, Engineering, business.industry, Motion (physics), Robot control, Computer Science::Robotics, Robot, Computer vision, Artificial intelligence, business, Representation (mathematics), Adaptation (computer science), Humanoid robot
Abstract: This paper presents a new method to synthesize full body motion for controlling humanoid robots in highly constrained environments. Given a reference motion of the robot and the corresponding environment configuration, the spatial relationships between the robot body parts and the environment objects are extracted as a representation called the Interaction Mesh. Such a representation is then used in adapting the reference motion to an altered environment. By preserving the spatial relationships while satisfying physical constraints, collision-free and well balanced motions can be generated automatically and efficiently. Experimental results show that the proposed method can adapt different full body motions in significantly modified environments. Our method can be applied in precise robotic controls under complicated environments, such as rescue robots in accident scenes and searching robots in highly constrained spaces.
Published: 2013
Full Text: View/download PDF

24. Real-time physical modelling of character movements with microsoft kinect

Author: Hubert P. H. Shum and Edmond S. L. Ho
Subjects: business.industry, Computer science, Dynamics (mechanics), ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION, PID controller, Kinematics, Virtual reality, Motion (physics), Computer Science::Robotics, Match moving, Computer graphics (images), Torque, Computer vision, Artificial intelligence, business, ComputingMethodologies_COMPUTERGRAPHICS, Curse of dimensionality
Abstract: With the advancement of motion tracking hardware such as the Microsoft Kinect, synthesizing human-like characters with real-time captured movements becomes increasingly important. Traditional kinematics and dynamics approaches perform sub-optimally when the captured motion is noisy or even incomplete. In this paper, we proposed a unified framework to control physically simulated characters with live captured motion from Kinect. Our framework can synthesize any posture in a physical environment using external forces and torques computed by a PD controller. The major problem of Kinect is the incompleteness of the captured posture, with some degree of freedom (DOF) missing due to occlusions and noises. We propose to search for a best matched posture from a motion database constructed in a dimensionality reduced space, and substitute the missing DOF to the live captured data. Experimental results show that our method can synthesize realistic character movements from noisy captured motion. The proposed algorithm is computationally efficient and can be applied to a wide variety of interactive virtual reality applications such as motion-based gaming, rehabilitation and sport training.
Published: 2012
Full Text: View/download PDF

25. Simulating Interactions of Characters

Author: Hubert P. H. Shum, Edmond S. L. Ho, and Taku Komura
Subjects: business.industry, Computer science, Computer graphics (images), Character animation, Computer vision, Artificial intelligence, Crowd simulation, business, Motion capture
Abstract: It is difficult to create scenes where multiple characters densely interact with each other. Manually creating the motions of characters is time consuming due to the correlation of the movements between the characters. Capturing the motions of multiple characters is also difficult as it requires a huge amount of post-processing of the data. In this paper, we explain the methods we have proposed to simulate close interactions of characters based on singly captured motions. We propose methods to (1) control characters intelligently to cooperatively / competitively interact with the other characters, and (2) generate movements that include close interactions such as tangling the segments with the others by taking into account the topological relationship of the characters.
Published: 2008
Full Text: View/download PDF

26. Planning tangling motions for humanoids

Author: Edmond S. L. Ho and Taku Komura
Subjects: business.industry, Computer science, Mobile robot, Plan (drawing), Robot end effector, Motion (physics), law.invention, Gait (human), law, Robot, Computer vision, Artificial intelligence, Motion planning, business, Humanoid robot
Abstract: There are not much methods for humanoid robots to path-plan movements such as piggyback-carry, firefighter-lift or giving a shoulder to support the gait of another. Such methods are essential to apply humanoid robots for rescuing people in emergency, or for taking care of injured/elderly people in the hospital. It is difficult to plan such motions as they require the limbs of the humanoids to be tangled. In this paper, we propose a new method to plan such movements using a criteria based on the Gauss linking integral (GLI). The user first specifies the target configurations of the humanoids. The system then detects the tangles made between the bodies, and then plans how the humanoids should move to arrive to the target posture from separate configurations. The motion get entangled is planned by the rapidly-exploring random trees (RRT) expansion. Using our method, it is possible to path-plan complex tangling motions which were difficult by previous methods.
Published: 2007
Full Text: View/download PDF

27. Single sketch image based 3D car shape reconstruction with deep learning and lazy learning

Author: Shigeo Morishima, Hubert P. H. Shum, Edmond S. L. Ho, Naoiki Nozawa, Bouatouch, Kadi, Sousa, A. Augusto, and Braz, Jose
Subjects: Computer science, business.industry, G500, Deep learning, G400, 3D reconstruction, H700, Sketch, Lazy learning, Computer vision, Artificial intelligence, Shape reconstruction, business, Image based
Abstract: Efficient car shape design is a challenging problem in both the automotive industry and the computer animation/games industry. In this paper, we present a system to reconstruct the 3D car shape from a single 2D sketchimage. To learn the correlation between 2D sketches and 3D cars, we propose a Variational Autoencoder deepneural network that takes a 2D sketch and generates a set of multi-view depth and mask images, which forma more effective representation comparing to 3D meshes, and can be effectively fused to generate a 3D carshape. Since global models like deep learning have limited capacity to reconstruct fine-detail features, wepropose a local lazy learning approach that constructs a small subspace based on a few relevant car samples inthe database. Due to the small size of such a subspace, fine details can be represented effectively with a smallnumber of parameters. With a low-cost optimization process, a high-quality car shape with detailed featuresis created. Experimental results show that the system performs consistently to create highly realistic cars ofsubstantially different shape and topology.

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

27 results on '"Edmond S. L. Ho"'

1. Spatio-Temporal Manifold Learning for Human Motions via Long-Horizon Modeling

2. Image editing-based data augmentation for illumination-insensitive background subtraction

3. 3DBodyNet: Fast Reconstruction of 3D Animatable Human Body Shape from a Single Commodity Depth Camera

4. Emotion Transfer for 3D Hand and Full Body Motion using StarGAN

5. Makeup Style Transfer on Low-quality Images with Weighted Multi-scale Attention

6. Assessing Facial Symmetry and Attractiveness using Augmented Reality

7. Illumination-Aware Multi-Task GANs for Foreground Segmentation

8. 3D Object Reconstruction from Imperfect Depth Data Using Extended YOLOv3 Network

9. 3D Car Shape Reconstruction from a Single Sketch Image

10. Prior-less 3D Human Shape Reconstruction with an Earth Mover’s Distance Informed CNN

11. Illumination-Based Data Augmentation for Robust Background Subtraction

12. Unsupervised abnormal behaviour detection with overhead crowd video

13. Topology Aware Data-Driven Inverse Kinematics

14. Real-Time Posture Reconstruction for Microsoft Kinect

15. Interactive partner control in close interactions for real-time applications

16. SkillVis: a visualization tool for boxing skill assessment

17. Spatial relationship preserving character motion adaptation

18. Indexing and Retrieving Motions of Characters in Close Contact

19. Improving posture classification accuracy for depth sensor-based human activity monitoring in smart environments

20. Modeling Spatial Relations of Human Body Parts for Indexing and Retrieving Close Character Interactions

21. An Energy-Driven Motion Planning Method for Two Distant Postures

22. A multi-resolution approach for adapting close character interaction

23. Motion adaptation for humanoid robots in constrained environments

24. Real-time physical modelling of character movements with microsoft kinect

25. Simulating Interactions of Characters

26. Planning tangling motions for humanoids

27. Single sketch image based 3D car shape reconstruction with deep learning and lazy learning

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

27 results on '"Edmond S. L. Ho"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources