Abstract:In recent years, peer-to-peer (P2P) technologies are used for flexible and scalable information exchange in the Internet, but there exist problems to be solved for reliable information exchange. It is important to trace how data circulates between peers and how data modifications are performed during the circulation before reaching the destination for enhancing the reliability of exchanged information. However, such lineage tracing is not easy in current P2P networks, since data replications and modifications are performed independently by autonomous peers-this creates a lack of reliability among the records exchanged. In this paper, we propose a framework for traceable record exchange in a P2P network. By managing historical information in distributed peers, we make the modification and exchange histories of records traceable. One of the features of our work is that the database technologies are utilized for realizing the framework. Histories are maintained in a relational database in each peer, and tracing queries are written in the datalog query language and executed in a P2P network by cooperating peers. This paper describes the concept of the framework and overviews the approach to query processing.
出版日期: 2008-09-05
引用本文:
. Traceable P2P record exchange: a database-oriented
approach[J]. Frontiers of Computer Science in China - Selected Publications from Chinese Universities, 0, (): 257-267.
LI Fengrong, IIDA Takuya, ISHIKAWA Yoshiharu. Traceable P2P record exchange: a database-oriented
approach. Front. Comput. Sci., 0, (): 257-267.
Androutsellis-Theotokis S, Spinellis D . A survey of peer-to-peercontent distribution technologies. ACMComputing Surveys, 2004, 36(4): 335–371. doi:10.1145/1041680.1041681
2
Tan W C . Research problems in data provenance. IEEE Data Engineering Bulletin, 2004, 27(4): 45–52
3
Widom J . Trio:A system for integrated management of data, accuracy, and lineage. In: Proceedings of Conference on Innovative DataSystems Research (CIDR), 2005, 262–276
4
Abiteboul S, Hull R, Vianu V . Foundations of Databases. Addison-Wesley, 1995
5
Li F R, Ishikawa Y . Traceable P2P record exchangebased on database technologies. In: Proceedingsof Asia-Pacific Web Conference Lecture Notes in Computer Science (LNCS), 2008, 4976: 475–486
6
Fagin R, Kolaitis P G, Miller R J, et al.. Data exchange: Semantics and query answering. Theoretical Computer Science, 2005, 336(1): 89–124. doi:10.1016/j.tcs.2004.10.033
7
Loo B T, Condie T, Garofalakis M, et al.. Declarative networking: Language, executionand optimization. In: Proceedings of theACM SIGMOD International Conference on Management of Data, 2006, 97–108
8
Aberer K, Cudre-Mauroux P . Semantic overlay networks. In: Proceedings of the International Conferenceon Very Large Data Bases, 2005, 1367
9
Buneman P, Khanna S, Tan W C . Data provenance: Some basic issues. In: Proceedings of 20th Conference on Foundations of Software Technologyand Theoretical Computer Science (FST TCS 2000), LNCS, New Delhi,India, 2000, 1974: 87–93. doi:10.1007/3‐540‐44450‐5_6
10
Buneman P, Tan W C . Provenance in databases (tutorial). In: Proc ACM SIGMOD, 2007, 1171–1173
11
Cui Y W, Widom J . Lineage tracing for generaldata warehouse transformations. In: ProcVLDB, 2001, 471–480
12
Cui Y W, Widom J, Wiener J L . Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems, 2000, 25(2): 179–227. doi:10.1145/357775.357777
13
Benjelloun O, Sarma A D, Halevy A, et al.. ULDBs: Databases with uncertainty and lineage. In: Proc VLDB, 2006, 953–964
14
Buneman P, Cheney J, Tan W C, et al.. Curated databases. In: Proceedings of the ACM SIGACT-SIGMOD-SIGADI Symposium on Principlesof Database Systems, 2008, 1–12
15
Bhagwat D, Chiticariu L, Tan W C, et al.. An annotation management system for relationaldatabases. In: Proceedings of VLDB, 2004, 900–911
16
Green T J, Karvounarakis G, Taylor N E, et al.. Orchestra: Facilitating collaborative data sharing. In: Proceedings of ACM SIGMOD, 2007, 1131–1133
17
Ives Z, Khandelwal N, Kapur A, et al.. Orchestra: Rapid, collaborative sharing of dynamic data. In: Proceedings of CIDR, 2005, 107–118
18
Orchestra: Managing the collaborative sharing ofevolving data. http://www.csi.upenn.edu/zives/orchestra
19
Buneman P, Khanna S, Tan W C . Why and where: A characterization of data provenance. In: Proceedings of ICDT, LNCS, 2001, 1973: 316–330
20
Halevy A, Franklin M, Maier D . Principles of dataspace systems. In: Proceedings of ACM PODS, 2006, 1–9