the-last-thing/text/abstract.tex

33 lines
3.5 KiB
TeX
Raw Normal View History

2017-10-05 20:52:19 +02:00
\chapter{Abstract}
2021-07-08 03:27:41 +02:00
\label{ch:abs}
2021-10-15 00:00:14 +02:00
% \kat{Il faut aussi en francais :) }
% \mk{D'accord :( }
2021-07-16 02:28:41 +02:00
Sensors, portable devices, and location-based services, generate massive amounts of geo-tagged, and/or location- and user-related data on a daily basis.
2021-10-12 12:59:05 +02:00
The manipulation of such data is useful in numerous application domains, e.g.,~healthcare, intelligent buildings, and traffic monitoring.
2021-10-15 00:16:38 +02:00
A high percentage of these data carry information of user activities and other personal details, and thus their manipulation and sharing raise concerns about the privacy of the individuals involved.
2021-10-15 00:00:14 +02:00
To enable the secure---from the user privacy perspective---data sharing, researchers have already proposed various seminal techniques for the protection of user privacy.
2021-07-16 02:28:41 +02:00
However, the continuous fashion in which data are generated nowadays, and the high availability of external sources of information, pose more threats and add extra challenges to the problem.
2021-10-15 00:00:14 +02:00
% \kat{Mention here the extra challenges posed by the specific problem that you address : the Landmark privacy}
2021-11-25 18:18:04 +01:00
It is therefore essential to design solutions that not only guarantee privacy protection but also provide configurability and account for the preferences of the users.
2021-07-16 02:28:41 +02:00
% Survey
2021-10-15 00:00:14 +02:00
In this thesis, we investigate the literature regarding data privacy in continuous data publishing, and report on the proposed solutions, with a special focus on solutions concerning location or geo-referenced data.
As a matter of fact, a wealth of algorithms has been proposed for privacy-preserving data publishing, either for microdata or statistical data.
2021-10-12 12:59:05 +02:00
In this context, we seek to offer a guide that would allow readers to choose the proper algorithm(s) for their specific use case accordingly.
2021-11-25 18:18:04 +01:00
We provide an insight into time-related properties of the algorithms, e.g.,~if they work on finite or infinite data, or if they take into consideration any underlying data dependence.
2021-07-16 02:28:41 +02:00
% Landmarks
2021-11-25 18:18:04 +01:00
Having discussed the literature around continuous data publishing, we proceed to propose a novel type of data privacy, called \emph{{\thething} privacy}.
2021-10-12 12:59:05 +02:00
We argue that in continuous data publishing, events are not equally significant in terms of privacy, and hence they should affect the privacy-preserving processing differently.
2021-07-16 02:28:41 +02:00
Differential privacy is a well-established paradigm in privacy-preserving time series publishing.
2021-11-25 18:18:04 +01:00
The existing differential privacy schemes protect either a single timestamp, or all the data per user or per window in the time series; however, considering all timestamps as equally significant.
2021-10-15 00:00:14 +02:00
The novel scheme that we propose, {\thething} privacy, is based on differential privacy, but also takes into account significant events (\emph{\thethings}) in the time series and allocates the available privacy budget accordingly.
We design three privacy schemes that guarantee {\thething} privacy and further extend them in order to provide more robust privacy protection to the {\thething} set.
We evaluate our proposal on real and synthetic data sets and assess the impact on data utility with emphasis on situations under the presence of temporal correlation.
% \kat{add selection, and a small comment on the conclusions driven by the experiments.}
The results of the experimental evaluation and comparative analysis of {\thething} privacy validate its applicability to several use case scenarios with and without the presence of temporal correlation.
2021-07-16 02:28:41 +02:00
2021-07-08 03:27:41 +02:00
\paragraph{Keywords:}
2021-11-02 19:10:50 +01:00
data quality, data privacy, continuous data publishing, crowdsensing, privacy-preserving data processing