the-last-thing/text/evaluation/thething.tex

\section{Landmark events}
\label{sec:eval-lmdk}

% \kat{After discussing with Dimitris, I thought you are keeping one chapter for the proposals of the thesis. In this case, it would be more clean to keep the theoretical contributions in one chapter and the evaluation in a separate chapter. }
% \mk{OK.}
In this section, we present the experiments that we performed, to test the methodology that we presented in Section~\ref{subsec:lmdk-sol}, on real and synthetic data sets. 

With the experiments on the real data sets (Section~\ref{subsec:lmdk-expt-bgt}), we show the performance in terms of data utility of our three {\thething}  privacy budget allocation schemes: Skip, Uniform and Adaptive.
We define data utility as the Mean Absolute Error introduced by the privacy mechanism.
We compare with the event and user differential privacy, and show that in the general case, {\thething} privacy allows for better data utility than user differential privacy.

With the experiments on the synthetic data sets (Section~\ref{subsec:lmdk-expt-cor}) we show the privacy loss \kat{in the previous set of experiments we were measuring the MAE, now we are measuring the privacy loss... Why is that? Isn't it two sides of the same coin? }by our framework when tuning the size and statistical characteristics of the input {\thething} set $L$ with special emphasis on how the privacy loss under temporal correlation is affected by the number and distribution of the {\thethings}.
\kat{mention briefly what you observe}


\subsection{Budget allocation schemes}
\label{subsec:lmdk-expt-bgt}

Figure~\ref{fig:real} exhibits the performance of the three mechanisms, Skip, Uniform, and Adaptive applied on the three data sets that we study.
% For the Geolife data set (Figure~\ref{fig:geolife}), Skip has the best performance (measured in Mean Absolute Error, in meters) because it invests the most budget overall at every regular event, by approximating the {\thething} data based on previous releases.
% Due to the data set's high density (every $1$--$5$ seconds or every $5$--$10$ meters per point) approximating constantly has a low impact on the data utility.
% On the contrary, the lower density of the T-drive data set (Figure~\ref{fig:t-drive}) has a negative impact on the performance of Skip.

For the Copenhagen data set (Figure~\ref{fig:copenhagen}), Adaptive has a constant\kat{it is not constant, for 0 it is much lower} overall performance and performs best for $0$\%, $60$\%, and $80$\% {\thethings} \kat{this is contradictory: you say that it is constant overall, and then that it is better for certain percentages. }.
We notice that for $0$\% {\thethings}, it achieves better utility than the event-level protection.\kat{what does this mean? how is it possible?}
The Skip model excels, compared to the others, at cases where it needs to approximate $20$\%--$40$\% or $100$\% of the times.\kat{it seems a little random.. do you have an explanation? (rather few times or all?)}

The combination of the small range of measurements in HUE ($[0.28$, $4.45]$ with an average of $0.88$kWh) and the large scale in the Laplace mechanism, results in a low mean absolute error for Skip (Figure~\ref{fig:hue}).
In general, a scheme that favors approximation over noise injection would achieve a better performance in this case.
\kat{why?explain}
However, the Adaptive model performs by far better than Uniform and strikes a nice balance\kat{???} between event- and user-level protection for all {\thething} percentages.

In the T-drive data set (Figure~\ref{fig:t-drive}), the Adaptive mechanism outperforms Uniform by $10$\%--$20$\% for all {\thething} percentages greater than $40$\% and Skip by more than $20$\%.
The lower density (average distance of $623$m) of the T-drive data set has a negative impact on the performance of Skip; republishing a previous perturbed value is now less accurate than perturbing the new location.


\begin{figure}[htp]
	\centering
	\subcaptionbox{Copenhagen\label{fig:copenhagen}}{%
		\includegraphics[width=.5\linewidth]{evaluation/copenhagen}%
	}%
	\hspace{\fill}
	\subcaptionbox{HUE\label{fig:hue}}{%
		\includegraphics[width=.5\linewidth]{evaluation/hue}%
	}%
	\subcaptionbox{T-drive\label{fig:t-drive}}{%
		\includegraphics[width=.5\linewidth]{evaluation/t-drive}%
	}%
	\caption{The mean absolute error (a)~as a percentage, (b)~in kWh, and (c)~in meters of the released data for different {\thething} percentages.}
	\label{fig:real}
\end{figure}

In general, we can claim that the Adaptive is the most reliable and best performing mechanism with minimal tuning\kat{what does minimal tuning mean?}, if we take into consideration the drawbacks of the Skip mechanism mentioned in Section~\ref{subsec:lmdk-mechs}. \kat{you can mention them also here briefly, and give the pointer for the section}
Moreover, designing a data-dependent sampling scheme \kat{what would be the main characteristic of the scheme? that it picks landmarks how?} would possibly\kat{possibly is not good enough, if you are sure remove it. Otherwise mention that more experiments need to be done?} result in better results for Adaptive.


\subsection{Temporal distance and correlation}
\label{subsec:lmdk-expt-cor}

Figure~\ref{fig:avg-dist} shows a comparison of the average temporal distance of the events from the previous/next {\thething} or the start/end of the time series for various distributions in synthetic data.
More particularly, we count for every event the total number of events between itself and the nearest {\thething} or the series edge.

\begin{figure}[htp]
  \centering
  \includegraphics[width=.5\linewidth]{evaluation/avg-dist}%
  \caption{Average temporal distance of the events from the {\thethings} for different {\thethings} percentages within a time series in various {\thethings} distributions.}
  \label{fig:avg-dist}
\end{figure}

We observe that the uniform and bimodal distributions tend to limit the regular event--{\thething} distance.
This is due to the fact that the former scatters the {\thethings}, while the latter distributes them on both edges, leaving a shorter space uninterrupted by {\thethings}.
% and as a result they reduce the uninterrupted space by landmarks in the sequence.
On the contrary, distributing the {\thethings} at one part of the sequence, as in skewed or symmetric, creates a wider space without {\thethings}.

Figure~\ref{fig:dist-cor} illustrates a comparison among the aforementioned distributions regarding the overall privacy loss under (a)~weak, (b)~moderate, and (c)~strong temporal correlation degrees.
The line shows the overall privacy loss---for all cases of {\thethings} distribution---without temporal correlation.

\begin{figure}[htp]
  \centering
  \subcaptionbox{Weak correlation\label{fig:dist-cor-wk}}{%
    \includegraphics[width=.5\linewidth]{evaluation/dist-cor-wk}%
  }%
  \hspace{\fill}
  \subcaptionbox{Moderate correlation\label{fig:dist-cor-mod}}{%
    \includegraphics[width=.5\linewidth]{evaluation/dist-cor-mod}%
  }%
  \subcaptionbox{Strong correlation\label{fig:dist-cor-stg}}{%
    \includegraphics[width=.5\linewidth]{evaluation/dist-cor-stg}%
  }%
  \caption{Privacy loss \kat{what is the unit for privacy loss? I t should appear on the diagram} for different {\thethings} percentages and distributions under (a)~weak, (b)~moderate, and (c)~strong degrees of temporal correlation.
  The line shows the overall privacy loss without temporal correlation.}
  \label{fig:dist-cor}
\end{figure}

In combination with Figure~\ref{fig:avg-dist}, we conclude that a greater average event--{\thething} even distance in a distribution can result into greater overall privacy loss under moderate and strong temporal correlation.
This is due to the fact that the backward/forward privacy loss accumulates more over time in wider spaces without {\thethings} (see Section~\ref{sec:correlation}).
Furthermore, the behavior of the privacy loss is as expected regarding the temporal correlation degree.
Predictably, a stronger correlation degree generates higher privacy loss while widening the gap between the different distribution cases.
On the contrary, a weaker correlation degree makes it harder to differentiate among the {\thethings} distributions.
The privacy loss under a weak correlation degree converge.
working on 5.2 2021-10-12 17:26:52 +02:00			`\section{Landmark events}`
evaluation: Minor corrections 2021-10-11 11:08:03 +02:00			`\label{sec:eval-lmdk}`
Structure 2021-07-18 17:31:05 +02:00
Chapters and sections 2021-09-07 16:06:42 +02:00			`% \kat{After discussing with Dimitris, I thought you are keeping one chapter for the proposals of the thesis. In this case, it would be more clean to keep the theoretical contributions in one chapter and the evaluation in a separate chapter. }`
			`% \mk{OK.}`
evaluation: Minor corrections 2021-10-11 11:08:03 +02:00			`In this section, we present the experiments that we performed, to test the methodology that we presented in Section~\ref{subsec:lmdk-sol}, on real and synthetic data sets.`
Structure 2021-07-18 17:31:05 +02:00
working on 5.2 2021-10-12 17:26:52 +02:00			`With the experiments on the real data sets (Section~\ref{subsec:lmdk-expt-bgt}), we show the performance in terms of data utility of our three {\thething} privacy budget allocation schemes: Skip, Uniform and Adaptive.`
			`We define data utility as the Mean Absolute Error introduced by the privacy mechanism.`
			`We compare with the event and user differential privacy, and show that in the general case, {\thething} privacy allows for better data utility than user differential privacy.`
Structure 2021-07-18 17:31:05 +02:00
working on 5.2 2021-10-12 17:26:52 +02:00			With the experiments on the synthetic data sets (Section~\ref{subsec:lmdk-expt-cor}) we show the privacy loss \kat{in the previous set of experiments we were measuring the MAE, now we are measuring the privacy loss... Why is that? Isn't it two sides of the same coin? }by our framework when tuning the size and statistical characteristics of the input {\thething} set $L$ with special emphasis on how the privacy loss under temporal correlation is affected by the number and distribution of the {\thethings}.
			`\kat{mention briefly what you observe}`
Structure 2021-07-18 17:31:05 +02:00


working on 5.2 2021-10-12 17:26:52 +02:00			`\subsection{Budget allocation schemes}`
			`\label{subsec:lmdk-expt-bgt}`

5.2.1. 2021-10-12 17:58:22 +02:00			`Figure~\ref{fig:real} exhibits the performance of the three mechanisms, Skip, Uniform, and Adaptive applied on the three data sets that we study.`
lmdk-expt: New results and discussion 2021-10-09 12:57:31 +02:00			`% For the Geolife data set (Figure~\ref{fig:geolife}), Skip has the best performance (measured in Mean Absolute Error, in meters) because it invests the most budget overall at every regular event, by approximating the {\thething} data based on previous releases.`
			`% Due to the data set's high density (every $1$--$5$ seconds or every $5$--$10$ meters per point) approximating constantly has a low impact on the data utility.`
			`% On the contrary, the lower density of the T-drive data set (Figure~\ref{fig:t-drive}) has a negative impact on the performance of Skip.`
5.2.1. 2021-10-12 17:58:22 +02:00
			`For the Copenhagen data set (Figure~\ref{fig:copenhagen}), Adaptive has a constant\kat{it is not constant, for 0 it is much lower} overall performance and performs best for $0$\%, $60$\%, and $80$\% {\thethings} \kat{this is contradictory: you say that it is constant overall, and then that it is better for certain percentages. }.`
			`We notice that for $0$\% {\thethings}, it achieves better utility than the event-level protection.\kat{what does this mean? how is it possible?}`
			`The Skip model excels, compared to the others, at cases where it needs to approximate $20$\%--$40$\% or $100$\% of the times.\kat{it seems a little random.. do you have an explanation? (rather few times or all?)}`

			`The combination of the small range of measurements in HUE ($[0.28$, $4.45]$ with an average of $0.88$kWh) and the large scale in the Laplace mechanism, results in a low mean absolute error for Skip (Figure~\ref{fig:hue}).`
lmdk-expt: New results and discussion 2021-10-09 12:57:31 +02:00			`In general, a scheme that favors approximation over noise injection would achieve a better performance in this case.`
5.2.1. 2021-10-12 17:58:22 +02:00			`\kat{why?explain}`
			`However, the Adaptive model performs by far better than Uniform and strikes a nice balance\kat{???} between event- and user-level protection for all {\thething} percentages.`

evaluation: Minor corrections 2021-10-11 01:13:45 +02:00			`In the T-drive data set (Figure~\ref{fig:t-drive}), the Adaptive mechanism outperforms Uniform by $10$\%--$20$\% for all {\thething} percentages greater than $40$\% and Skip by more than $20$\%.`
5.2.1. 2021-10-12 17:58:22 +02:00			`The lower density (average distance of $623$m) of the T-drive data set has a negative impact on the performance of Skip; republishing a previous perturbed value is now less accurate than perturbing the new location.`


working on 5.2 2021-10-12 17:26:52 +02:00			`\begin{figure}[htp]`
			`\centering`
			`\subcaptionbox{Copenhagen\label{fig:copenhagen}}{%`
			`\includegraphics[width=.5\linewidth]{evaluation/copenhagen}%`
			`}%`
			`\hspace{\fill}`
			`\subcaptionbox{HUE\label{fig:hue}}{%`
			`\includegraphics[width=.5\linewidth]{evaluation/hue}%`
			`}%`
			`\subcaptionbox{T-drive\label{fig:t-drive}}{%`
			`\includegraphics[width=.5\linewidth]{evaluation/t-drive}%`
			`}%`
			`\caption{The mean absolute error (a)~as a percentage, (b)~in kWh, and (c)~in meters of the released data for different {\thething} percentages.}`
			`\label{fig:real}`
			`\end{figure}`
lmdk-expt: New results and discussion 2021-10-09 12:57:31 +02:00
5.2.1. 2021-10-12 17:58:22 +02:00			`In general, we can claim that the Adaptive is the most reliable and best performing mechanism with minimal tuning\kat{what does minimal tuning mean?}, if we take into consideration the drawbacks of the Skip mechanism mentioned in Section~\ref{subsec:lmdk-mechs}. \kat{you can mention them also here briefly, and give the pointer for the section}`
			`Moreover, designing a data-dependent sampling scheme \kat{what would be the main characteristic of the scheme? that it picks landmarks how?} would possibly\kat{possibly is not good enough, if you are sure remove it. Otherwise mention that more experiments need to be done?} result in better results for Adaptive.`
Structure 2021-07-18 17:31:05 +02:00

lmdk-eval: Sectioning 2021-10-09 15:39:31 +02:00			`\subsection{Temporal distance and correlation}`
			`\label{subsec:lmdk-expt-cor}`
lmdk-expt: Reviewed all graphs for synthetic 2021-10-09 13:27:16 +02:00
Structure 2021-07-18 17:31:05 +02:00			`Figure~\ref{fig:avg-dist} shows a comparison of the average temporal distance of the events from the previous/next {\thething} or the start/end of the time series for various distributions in synthetic data.`
			`More particularly, we count for every event the total number of events between itself and the nearest {\thething} or the series edge.`

			`\begin{figure}[htp]`
			`\centering`
lmdk-expt: Reviewed all graphs for synthetic 2021-10-09 13:27:16 +02:00			`\includegraphics[width=.5\linewidth]{evaluation/avg-dist}%`
Structure 2021-07-18 17:31:05 +02:00			`\caption{Average temporal distance of the events from the {\thethings} for different {\thethings} percentages within a time series in various {\thethings} distributions.}`
			`\label{fig:avg-dist}`
			`\end{figure}`

evaluation: Minor corrections in thething 2021-10-10 22:28:18 +02:00			`We observe that the uniform and bimodal distributions tend to limit the regular event--{\thething} distance.`
			`This is due to the fact that the former scatters the {\thethings}, while the latter distributes them on both edges, leaving a shorter space uninterrupted by {\thethings}.`
			`% and as a result they reduce the uninterrupted space by landmarks in the sequence.`
			`On the contrary, distributing the {\thethings} at one part of the sequence, as in skewed or symmetric, creates a wider space without {\thethings}.`

lmdk-expt: Reviewed all graphs for synthetic 2021-10-09 13:27:16 +02:00			`Figure~\ref{fig:dist-cor} illustrates a comparison among the aforementioned distributions regarding the overall privacy loss under (a)~weak, (b)~moderate, and (c)~strong temporal correlation degrees.`
Structure 2021-07-18 17:31:05 +02:00			`The line shows the overall privacy loss---for all cases of {\thethings} distribution---without temporal correlation.`

			`\begin{figure}[htp]`
			`\centering`
			`\subcaptionbox{Weak correlation\label{fig:dist-cor-wk}}{%`
lmdk-expt: Reviewed all graphs for synthetic 2021-10-09 13:27:16 +02:00			`\includegraphics[width=.5\linewidth]{evaluation/dist-cor-wk}%`
Structure 2021-07-18 17:31:05 +02:00			`}%`
			`\hspace{\fill}`
			`\subcaptionbox{Moderate correlation\label{fig:dist-cor-mod}}{%`
lmdk-expt: Reviewed all graphs for synthetic 2021-10-09 13:27:16 +02:00			`\includegraphics[width=.5\linewidth]{evaluation/dist-cor-mod}%`
Structure 2021-07-18 17:31:05 +02:00			`}%`
			`\subcaptionbox{Strong correlation\label{fig:dist-cor-stg}}{%`
lmdk-expt: Reviewed all graphs for synthetic 2021-10-09 13:27:16 +02:00			`\includegraphics[width=.5\linewidth]{evaluation/dist-cor-stg}%`
Structure 2021-07-18 17:31:05 +02:00			`}%`
working on 5.2 2021-10-12 17:26:52 +02:00			`\caption{Privacy loss \kat{what is the unit for privacy loss? I t should appear on the diagram} for different {\thethings} percentages and distributions under (a)~weak, (b)~moderate, and (c)~strong degrees of temporal correlation.`
Structure 2021-07-18 17:31:05 +02:00			`The line shows the overall privacy loss without temporal correlation.}`
			`\label{fig:dist-cor}`
			`\end{figure}`
evaluation: Minor corrections in thething 2021-10-10 22:28:18 +02:00
evaluation: Minor corrections 2021-10-11 11:08:03 +02:00			`In combination with Figure~\ref{fig:avg-dist}, we conclude that a greater average event--{\thething} even distance in a distribution can result into greater overall privacy loss under moderate and strong temporal correlation.`
evaluation: Minor corrections in thething 2021-10-10 22:28:18 +02:00			`This is due to the fact that the backward/forward privacy loss accumulates more over time in wider spaces without {\thethings} (see Section~\ref{sec:correlation}).`
			`Furthermore, the behavior of the privacy loss is as expected regarding the temporal correlation degree.`
			`Predictably, a stronger correlation degree generates higher privacy loss while widening the gap between the different distribution cases.`
			`On the contrary, a weaker correlation degree makes it harder to differentiate among the {\thethings} distributions.`
			`The privacy loss under a weak correlation degree converge.`