Author: "Lipyeow Lim" / Topic: computer.internet_protocol - Searchworks@Jio Institute Digital Library Search Results

1. Low-latency XPath Query Evaluation on Multi-Core Processors

Author: Benjamin Karsin, Henri Casanova, and Lipyeow Lim
Subjects: Web search query, Information retrieval, Computer science, computer.internet_protocol, XPath 2.0, 02 engineering and technology, Query optimization, Query expansion, XML Schema (W3C), 020204 information systems, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, Latency (engineering), computer, XML, XPath
Published: 2017
Full Text: View/download PDF

2. Challenges on Modeling Hybrid XML-Relational Databases

Author: Mirella M. Moro, Lipyeow Lim, and Yuan-Chi Chang
Subjects: Information retrieval, computer.internet_protocol, Relational database, Computer science, Redundancy (engineering), Business artifacts, computer, XML
Abstract: It is well known that XML has been widely adopted for its flexible and self-describing nature. However, relational data will continue to co-exist with XML for several different reasons one of which is the high cost of transferring everything to XML. In this context, data designers face the problem of modeling both relational and XML data within an integrated environment. This chapter highlights important questions on hybrid XML-relational database design and discusses use cases, requirements, and deficiencies in existing design methodologies especially in the light of data and schema evolution. The authors’ analysis results in several design guidelines and a series of challenges to be addressed by future research.
Published: 2010
Full Text: View/download PDF

3. Statistics-based parallelization of XPath queries in shared memory systems

Author: Bryant Wei Lun Kok, Anastasios Kementsietsidis, Lipyeow Lim, and Rajesh Bordawekar
Subjects: Theoretical computer science, Exploit, computer.internet_protocol, Computer science, Parallel computing, computer.software_genre, Automatic parallelization, XML database, Simple API for XML, Shared memory, Statistics, Latency (engineering), computer, XML, XPath
Abstract: The wide availability of commodity multi-core systems presents an opportunity to address the latency issues that have plaqued XML query processing. However, simply executing multiple XML queries over multiple cores merely addresses the throughput issue: intra-query parallelization is needed to exploit multiple processing cores for better latency. Toward this effort, this paper investigates the parallelization of individual XPath queries over shared-address space multi-core processors. Much previous work on parallelizing XPath in a distributed setting failed to exploit the shared memory parallelism of multi-core systems. We propose a novel, end-to-end parallelization framework that determines the optimal way of parallelizing an XML query. This decision is based on a statistics-based approach that relies both on the query specifics and the data statistics. At each stage of the parallelization process, we evaluate three alternative approaches, namely, data-, query-, and hybrid-partitioning. For a given XPath query, our parallelization algorithm uses XML statistics to estimate the relative efficiencies of these different alternatives and find an optimal parallel XPath processing plan. Our experiments using well-known XML documents validate our parallel cost model and optimization framework, and demonstrate that it is possible to accelerate XPath processing using commodity multi-core systems.
Published: 2010
Full Text: View/download PDF

4. Parallelization of XPath queries using multi-core processors

Author: Lipyeow Lim, Oded Shmueli, and Rajesh Bordawekar
Subjects: Multi-core processor, computer.internet_protocol, XPath 2.0, Computer science, Programming language, InformationSystems_DATABASEMANAGEMENT, Parallel computing, computer.software_genre, Path expression, Data modeling, Set (abstract data type), XML Schema (W3C), ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, computer, XML, XPath
Abstract: In this study, we present experiences of parallelizing XPath queries using the Xalan XPath engine on shared-address space multi-core systems. For our evaluation, we consider a scenario where an XPath processor uses multiple threads to concurrently navigate and execute individual XPath queries on a shared XML document. Given the constraints of the XML execution and data models, we propose three strategies for parallelizing individual XPath queries: Data partitioning, Query partitioning, and Hybrid (query and data) partitioning. We experimentally evaluated these strategies on an x86 Linux multi-core system using a set of XPath queries, invoked on a variety of XML documents using the Xalan XPath APIs. Experimental results demonstrate that the proposed parallelization strategies work very effectively in practice; for a majority of XPath queries under evaluation, the execution performance scaled linearly as the number of threads was increased. Results also revealed the pros and cons of the different parallelization strategies for different XPath query patterns.
Published: 2009
Full Text: View/download PDF

5. Persisting and querying biometric event streams with hybrid relational-XML DBMS

Author: Lipyeow Lim, Kyu Hyun Kim, Min Wang, and Daby Sow
Subjects: Biometrics, Database, Relational database, business.industry, computer.internet_protocol, Emerging technologies, Computer science, Complex event processing, computer.software_genre, Stream processing, Health care, Leverage (statistics), business, computer, XML
Abstract: Remote monitoring of patients' biometric data streams offers the possibility to physicians to extend and improve their services to chronically ill patients who are away from medical institutions. This emerging technology is a promising way to address important aspects of the cost issues that most health care systems are experiencing. In order to fulfill its potential, several challenges need to be overcome. First, the data collected needs to be filtered and annotated intelligently to help physicians cope with and navigate the large amount of patient sensor data received as a result of large scale remote health monitoring deployments. Secondly, efficient stream persistence and query mechanisms for these data need to be designed to satisfy health care regulations and help physicians track patient health histories accurately and efficiently. In this paper, we concentrate on the second challenge. We leverage emerging hybrid relational-XML database management systems to design a storage sub-system for remote health monitoring. We evaluate this approach by performing series of performance tests to assess the ability of the proposed system to handle the huge amount of biometric data streams requiring persistence.
Published: 2007
Full Text: View/download PDF

6. Schema advisor for hybrid relational-XML DBMS

Author: Lipyeow Lim, Mirella M. Moro, and Yuan-Chi Chang
Subjects: Document Structure Description, XML Encryption, Relational database, computer.internet_protocol, Computer science, Semi-structured model, Efficient XML Interchange, XML Signature, computer.software_genre, Database design, Simple API for XML, Relational database management system, XML Schema Editor, Schema (psychology), Streaming XML, XML schema, computer.programming_language, Database model, Database, Search engine indexing, Database schema, XML validation, computer.file_format, XML framework, XML database, XML Schema (W3C), Data model, Information model, computer, XML
Abstract: In response to the widespread use of the XML format for document representation and message exchange, major database vendors support XML in terms of persistence, querying and indexing. Specifically, the recently released IBM DB2 9 (for Linux, Unix and Windows) is a hybrid data server with optimized management of both XML and relational data. With the new option of storing and querying XML in a relational DBMS, data architects face the the decision of what portion of their data to persist as XML and what portion as relational data. This problem has not been addressed yet and represents a serious need in the industry. Hence, this paper describes ReXSA, a schema advisor tool that is being prototyped for IBM DB2 9. ReXSA proposes candidate database schemas given an information model of the enterprise data. It has the advantage of considering qualitative properties of the information model such as reuse, evolution and performance profiles for deciding how to persist the data. Finally, we show the viability and practicality of ReXSA by applying it to custom and real usecases.
Published: 2007
Full Text: View/download PDF

7. Preserving XML queries during schema evolution

Author: Mirella M. Moro, Susan Malaika, and Lipyeow Lim
Subjects: Document Structure Description, XML Encryption, computer.internet_protocol, Schematron, Computer science, Efficient XML Interchange, Semi-structured model, XML Signature, computer.software_genre, Logical schema, XML Schema Editor, Schema (psychology), Streaming XML, RELAX NG, XML schema, computer.programming_language, Information retrieval, Database schema, InformationSystems_DATABASEMANAGEMENT, XML validation, computer.file_format, Schema evolution, XML database, XML Schema (W3C), Document Schema Definition Languages, Star schema, Document Definition Markup Language, ComputingMethodologies_DOCUMENTANDTEXTPROCESSING, computer, XML
Abstract: In XML databases, new schema versions may be released as frequently as once every two weeks. This poster describes a taxonomy of changes for XML schema evolution. It examines the impact of those changes on schema validation and query evaluation. Based on that study, it proposes guidelines for XML schema evolution and for writing queries in such a way that they continue to operate as expected across evolving schemas.
Published: 2007
Full Text: View/download PDF

8. Semantic Data Management: Towards Querying Data with their Meaning

Author: Haixun Wang, Min Wang, and Lipyeow Lim
Subjects: SQL, Information retrieval, computer.internet_protocol, Relational database, Computer science, InformationSystems_DATABASEMANAGEMENT, Ontology (information science), computer.software_genre, Query language, Data modeling, XQuery, Relational database management system, Ontology, Domain knowledge, computer, XML, computer.programming_language, Semantic matching
Abstract: Relational database management systems are constantly being extended and augmented to accommodate data in different domains. Recently, with the increasing use of ontology in various applications, the need to support ontology, especially the related inferencing operation, in DBMS has become more concrete and urgent. However, manipulating knowledge along with relational data in DBMSs is not a trivial undertaking due to the mismatch in data models. In this paper, we introduce a framework for managing relational data and hierarchical domain knowledge together. Our framework persists taxonomies contained in ontologies by leveraging XML support in hybrid relational-XML DBMSs (e.g., IBM's DB2 v9) and rewrites ontology-based semantic matching queries using the industry-standard query languages, SQL/XML and XQuery. Compared with previous approaches, our approach does not materialize transitive closures of ontological relationships to support inferencing. Consequently, our method has wide applicability and good performance.
Published: 2007
Full Text: View/download PDF

9. Managing e-commerce catalogs in a DBMS with native XML support

Author: M. Wang and Lipyeow Lim
Subjects: Database, business.industry, Relational database, computer.internet_protocol, Computer science, InformationSystems_DATABASEMANAGEMENT, E-commerce, Query optimization, computer.software_genre, Schema evolution, Relational database management system, Sargable, IBM, business, computer, XML
Abstract: Electronic commerce is emerging as a major application area for database systems. A large number of e-commerce stores provide electronic product catalogs that allow customers to search products of interest and store owners to manage various product information. Due to the constant schema evolution and the sparsity of e-commerce data, most commercial e-commerce systems use the so-called vertical schema for data storage. However, query processing for data stored using vertical schema is extremely inefficient because current RDBMSs, especially its cost-based query optimizer, are specifically designed to deal with traditional horizontal schemas. In this paper, we show that e-catalog management can be naturally supported in IBM's system RX, the first DBMS that truly supports both XML and relational data in their native forms. By leveraging on system RX's hybrid nature, we present a novel solution for storing, managing, and querying e-catalog data. In addition to traditional queries, we show that our solution supports semantic queries as well. Our solution does not require a separate query optimization layer, because query optimization is handled within the hybrid DBMS engine itself
Published: 2005
Full Text: View/download PDF

10. XPathLearner

Author: Lipyeow Lim, Jeffrey Scott Vitter, Sriram Padmanabhan, Ronald Parr, and Min Wang
Subjects: Information retrieval, Computer science, computer.internet_protocol, Efficient XML Interchange, XML validation, computer.file_format, computer.software_genre, Path expression, XML framework, Simple API for XML, XML Schema (W3C), XML database, Data exchange, Streaming XML, Data mining, computer, XML, XPath
Abstract: The extensible mark-up language (XML) is gaining widespread use as a format for data exchange and storage on the World Wide Web. Queries over XML data require accurate selectivity estimation of path expressions to optimize query execution plans. Selectivity estimation of XML path expression is usually done based on summary statistics about the structure of the underlying XML repository. All previous methods require an off-line scan of the XML repository to collect the statistics. In this paper, we propose XPathLearner, a method for estimating selectivity of the most commonly used types of path expressions without looking at the XML data. XPathLearner gathers and refines the statistics using query feedback in an on-line manner and is especially suited to queries in Internet scale applications since the underlying XML repository is either inaccessible or too large to be scanned in its entirety. Besides the on-line property, our method also has two other novel features: (a) XPathLearner is workload-aware in collecting the statistics and thus can be more accurate than the more costly off-line method under tight memory constraints, and (b) XPathLearner automatically adjusts the statistics using query feedback when the underlying XML data change. We show empirically the estimation accuracy of our method using several real data sets.
Published: 2002
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Lipyeow Lim"'

1. Low-latency XPath Query Evaluation on Multi-Core Processors

2. Challenges on Modeling Hybrid XML-Relational Databases

3. Statistics-based parallelization of XPath queries in shared memory systems

4. Parallelization of XPath queries using multi-core processors

5. Persisting and querying biometric event streams with hybrid relational-XML DBMS

6. Schema advisor for hybrid relational-XML DBMS

7. Preserving XML queries during schema evolution

8. Semantic Data Management: Towards Querying Data with their Meaning

9. Managing e-commerce catalogs in a DBMS with native XML support

10. XPathLearner

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

10 results on '"Lipyeow Lim"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources