Repository logo

Towards a Privacy Preserving Framework for Publishing Longitudinal Data

dc.contributor.authorSehatkar, Morvarid
dc.contributor.supervisorMatwin, Stanislaw
dc.date.accessioned2014-09-26T17:23:28Z
dc.date.available2014-09-26T17:23:28Z
dc.date.created2014
dc.date.issued2014
dc.degree.disciplineGénie / Engineering
dc.degree.leveldoctorate
dc.degree.namePhD
dc.description.abstractRecent advances in information technology have enabled public organizations and corporations to collect and store huge amounts of individuals' data in data repositories. Such data are powerful sources of information about an individual's life such as interests, activities, and finances. Corporations can employ data mining and knowledge discovery techniques to extract useful knowledge and interesting patterns from large repositories of individuals' data. The extracted knowledge can be exploited to improve strategic decision making, enhance business performance, and improve services. However, person-specific data often contain sensitive information about individuals and publishing such data poses potential privacy risks. To deal with these privacy issues, data must be anonymized so that no sensitive information about individuals can be disclosed from published data while distortion is minimized to ensure usefulness of data in practice. In this thesis, we address privacy concerns in publishing longitudinal data. A data set is longitudinal if it contains information of the same observation or event about individuals collected at several points in time. For instance, the data set of multiple visits of patients of a hospital over a period of time is longitudinal. Due to temporal correlations among the events of each record, potential background knowledge of adversaries about an individual in the context of longitudinal data has specific characteristics. None of the previous anonymization techniques can effectively protect longitudinal data against an adversary with such knowledge. In this thesis we identify the potential privacy threats on longitudinal data and propose a novel framework of anonymization algorithms in a way that protects individuals' privacy against both identity disclosure and attribute disclosure, and preserves data utility. Particularly, we propose two privacy models: (K,C)^P -privacy and (K,C)-privacy, and for each of these models we propose efficient algorithms for anonymizing longitudinal data. An extensive experimental study demonstrates that our proposed framework can effectively and efficiently anonymize longitudinal data.
dc.faculty.departmentInformatique / Computer Science
dc.identifier.urihttp://hdl.handle.net/10393/31629
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-6634
dc.language.isoen
dc.publisherUniversité d'Ottawa / University of Ottawa
dc.subjectLongitudinal data
dc.subjectAnonymization
dc.subjectPrivacy preserving data publishing
dc.subjectData mining
dc.subjectSequence data
dc.titleTowards a Privacy Preserving Framework for Publishing Longitudinal Data
dc.typeThesis
thesis.degree.disciplineGénie / Engineering
thesis.degree.levelDoctoral
thesis.degree.namePhD
uottawa.departmentInformatique / Computer Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Sehatkar_Morvarid_2014_thesis.pdf
Size:
2.29 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
4.07 KB
Format:
Item-specific license agreed upon to submission
Description: