|
Call for Paper
(PDF)
Important
Dates
Submission Details
Program Committee
Publication
Workshop Program
Venue & Attractions
Hotel &
Travel
DASFAA 2010
|
Poor data
quality is known to compromise the credibility and efficiency of commercial
as well as public endeavours. Several developments from industry as well as
academia have contributed significantly towards addressing the problem.
These typically include analysts and practitioners who have contributed to
the design of strategies and methodologies for data governance; solution
architects including software vendors who have contributed towards
appropriate system architectures that promote data integration and; and
data experts who have contributed to data quality problems such as
duplicate detection, identification of outliers, consistency checking and
many more through the use of computational techniques. The attainment of
true data quality lies at the convergence of the three aspects, namely
organizational, architectural and computational.
At the same
time, importance of managing data quality has increased manifold in today's
global information sharing environments, as the diversity of sources,
formats and volume of data grows. In this workshop we target data quality
in the light of collaborative information systems where data creation and
ownership is increasingly difficult to establish. Collaborative settings
are evident in enterprise systems, where partner/customer data may pollute
enterprise data bases raising the need for data source attribution, as well
as in scientific applications, where data lineage across long running
collaborative scientific processes needs to be established. Collaborative
settings thus warrant a pipeline of data quality methods and techniques
that commence with (source) data assessment, data cleansing, methods for
sustained quality, integration and linkage, and eventually ability for
audit and attribution.
The
workshop will provide a forum to bring together diverse researchers and
make a consolidated contribution to new and extended methods to address the
challenges of data quality in collaborative settings. Topics covered by the
workshop include at least the following:
1.
Data
integration, linkage and fusion
2. Entity resolution, duplicate detection, and consistency checking
3. Data profiling and measurement
4. Use of data mining for data quality assessment
5. Methods for data transformation, reconciliation, consolidation
6. Algorithms for data cleansing
7. Data quality and cleansing in information extraction
8. Dealing with uncertain or noisy data (e.g., sensor data)
9. Data lineage and provenance
10. Models, frameworks, methodologies and metrics for data quality
11. Application specific data quality, case studies, experience reports
12. User/social perceptive on data quality and cleansing
13. Data quality and cleansing for complex data (e.g. documents,
semi-structured data, XMLs, multimedia data,
graphs, biosequences, etc.)
Workshop Program
The
full day workshop will consist of oral presentations, discussions, and
invited talks. The workshop will also provide opportunity for demo
sessions, where presenters can showcase advanced prototypes based on their
research where applicable. More details will come after the paper
submission.
Submission of Papers
Authors
should submit papers reporting original works that are currently not under review
or published elsewhere. The paper should be submitted in PDF format, with
maximum length twelve (12) pages, following Springer-Verlag's
LNCS manuscript submission guidelines, available at http://www.springer.de/comp/lncs/authors.html.
The
submission site of MCIS 2010 is
https://cmt.research.microsoft.com/MCIS2010/
Publication
Proceedings
are planned to be published as a volume of Lecture Notes in Computer
Science (LNCS) from Springer after the workshops. At the workshop site, we
will hand out informal on-site proceedings to participants.
Important Dates
Dec.
11, 2009 Paper submission deadline
Feb. 12, 2010 Acceptance notification to authors
Feb. 26, 2010 On-site paper deadline
Apr. 26, 2010 Final camera-ready copy deadline
Apr. 4, 2010 Workshop
Program Committee
Adam Jatowt,
Kyoto
University, Japan
Bin Wang, Northeastern Univeristy, China
Cheqing Jin, East China Normal University, China
Lei
Chen, Hong Kong University of Science
and Technology
Jiuyong Li, University of South Australia, Australia
Jun Gao, Peking University, China
Qing Liu, CSIRO, Australia
Marek Kowalkiewicz, SAP Australia
Marta Indulska, University of Queensland
Mohamed Medhat Gaber,
Monash University Australia
Wanita Sherchan, CSIRO
Australia
Yanfeng Shu, CSIRO
Australia
Workshop Organizers
Shazia
Sadiq, Xiaofang Zhou, Ke Deng
School of Information Technology and Electrical Engineering
The University of Queensland, Brisbane, Australia
Inquiries:
shazia, dengke,
zxf@itee.uq.edu.au
Xiaochun
Yang
School of Information Science and Engineering,
Northeastern University, China
Inquiries: yangxc@mail.neu.edu.cn
|