6 年之前 · 5abf3600e9
--- a/thesis/content/02-background/dapps.tex
+++ b/thesis/content/02-background/dapps.tex
@@ -1,8 +1,6 @@
 
				 The term Web 1.0 refers to the beginnings of the Internet, which consisted of simple static web pages. The central idea was to present or consume content. The characteristic of Web 2.0 is the user's participation in the creation process. Thus, a series of platforms (blogs, social networks) was launched on which users can provide content and connect. Typically, Web 2.0 platforms have a centralized structure, which entails the problems mentioned in the previous chapter. On the occasion of the 30th anniversary of the World Wide Web in March 2019, the initiator Tim Berners-Lee summed up that the Internet was misused - partly due to the system design \cite{berners-lee2019web30}.
			
 
				 
			
 
				-With the next version 3.0 of the web, more transparency, security, and fairness should be created. However, while there is broad agreement on what is meant by terms Web 1.0 and Web 2.0, there is no uniform definition of Web 3.0 that has prevailed to date. There are many ideas, but no final solution yet.
			
 
				-
			
 
				-An interpretation of what Web 3.0 is, is all about decentralization, hence it is also called the \acf{dWeb}. In this context, Web 3.0 is considered an umbrella term for a group of emerging technologies such as blockchain, crypto currencies, and distributed systems which are interconnected to create novel applications, so-called \acf{dApp}. Although decentralized applications have existed for a long time (e.g., BitTorrent), these applications do not meet the criteria of a \ac{dApp}.
			
 
				+With the next version 3.0 of the web, more transparency, security, and fairness should be created. However, while there is broad agreement on what is meant by terms Web 1.0 and Web 2.0, there is no uniform definition of Web 3.0 that has prevailed to date. There are many ideas, but no final solution yet. An interpretation of what Web 3.0 is, is all about decentralization, hence it is also called the \acf{dWeb}. In this context, Web 3.0 is considered an umbrella term for a group of emerging technologies such as blockchain, crypto currencies, and distributed systems which are interconnected to create novel applications, so-called \acf{dApp}. Although decentralized applications have existed for a long time (e.g., BitTorrent), these applications do not meet the criteria of a \ac{dApp}.
			
 
				 
			
 
				 \subsection{Characteristics of a \ac{dApp}}
			
 
				 \label{sec:dapp-characterisitics}
			
--- a/thesis/content/02-background/p2p.tex
+++ b/thesis/content/02-background/p2p.tex
@@ -1,10 +1,6 @@
 
				-The distinctive feature of \ac{P2P} systems is that each participant has the role of both a server and a client. The participants are therefore equal and provide each other with services, what is reflected in the naming. \ac{P2P} networks are usually characterized as overlay networks over the Internet. Concerning the structure of the overlay network, a distinction is made between structured and unstructured networks. The \ac{P2P} principle became well-known in 1999 with the file-sharing application Napster. The software connected its users and allowed accessing (mainly copyrighted) songs among the participants without having to offer them from a central server.
			
 
				+The distinctive feature of \ac{P2P} systems is that each participant has the role of both a server and a client. The participants are therefore equal and provide each other with services, what is reflected in the naming. \ac{P2P} networks are usually characterized as overlay networks over the Internet. Concerning the structure of the overlay network, a distinction is made between structured and unstructured networks. The \ac{P2P} principle became well-known in 1999 with the file-sharing application Napster. The software connected its users and allowed accessing (mainly copyrighted) songs among the participants without having to offer them from a central server. Popular applications of \ac{P2P} networks are file sharing (e.g., BitTorrent), instant messaging (e.g., Skype) and blockchain technology (e.g., Bitcoin).
			
 
				 
			
 
				-Popular applications of \ac{P2P} networks are file sharing (e.g., BitTorrent), instant messaging (e.g., Skype) and blockchain technology (e.g., Bitcoin).
			
 
				-
			
 
				-Their independence particularly characterizes \ac{P2P} networks: there are no control points and not necessarily a fixed infrastructure which leads to minimal operating costs. Besides, \ac{P2P} networks are self-organized and self-scaling, as each additional user contributes its resources.
			
 
				-
			
 
				-However, there are also some challenges in \ac{P2P} networks that need to be solved for successful operation. These include finding peers in the network (peer discovery) and finding resources (resource discovery). Especially in file sharing networks, solutions have to be found how to motivate users to upload data and not only use the download one-sidedly. The replication of data and the associated availability must also be taken into account in solutions. Another critical issue is the Internet connection of individual participants, which may not be powerful or permanent.
			
 
				+Their independence particularly characterizes \ac{P2P} networks: there are no control points and not necessarily a fixed infrastructure which leads to minimal operating costs. Besides, \ac{P2P} networks are self-organized and self-scaling, as each additional user contributes its resources. However, there are also some challenges in \ac{P2P} networks that need to be solved for successful operation. These include finding peers in the network (peer discovery) and finding resources (resource discovery). Especially in file sharing networks, solutions have to be found how to motivate users to upload data and not only use the download one-sidedly. The replication of data and the associated availability must also be taken into account in solutions. Another critical issue is the Internet connection of individual participants, which may not be powerful or permanent.
			
 
				 
			
 
				 \subsection{Unstructured \ac{P2P} Networks}
			
 
				 \label{sec:unstructured-p2p}
			
@@ -27,8 +23,6 @@ In structured networks, compliance with the structure is strictly controlled. By
 
				 
			
 
				 Usually, the routing algorithms are based on a \ac{DHT}. Hash tables are data structures in which key-value pairs are stored, whereby the key must be unique. The corresponding value can then be queried via the key. The keys are ids, which are generated with a hash function (e.g., SHA-1). For the addresses of the nodes and the files, ids are created equally, so that they lie in the same address space. For finding a file, it is searched at the node with the same or the next larger id. If it is not available there, it does not exist on the network.
			
 
				 
			
 
				-For joining a network, either one or more peers must be known as the entry point, or this information must be obtained from a bootstrap server. When entering a structured network, the joining node is assigned a unique id and thus positions itself in the structure. The routing tables of the nodes affected by the structural change must then be updated.
			
 
				-
			
 
				-When leaving a network, this happens either planned, and all affected nodes are informed to update their routing tables, or unexpected. Therefore, nodes must always check the correctness of their routing tables.
			
 
				+For joining a network, either one or more peers must be known as the entry point, or this information must be obtained from a bootstrap server. When entering a structured network, the joining node is assigned a unique id and thus positions itself in the structure. The routing tables of the nodes affected by the structural change must then be updated. When leaving a network, this happens either planned, and all affected nodes are informed to update their routing tables, or unexpected. Therefore, nodes must always check the correctness of their routing tables.
			
 
				 
			
 
				 Known routing algorithms that use \acp{DHT} include Chrod\cite{stoica2003chord}, CAN\cite{ratnasamy2001scalable}, Pastry\cite{rowstron2001pastry}, Tapestry\cite{zhao2004tapestry} and Kademlia\cite{maymounkov2002kademlia}. Among other things, they differ in their distinct structure and the hash functions used.
			
--- a/thesis/content/02-background/software-system-architecture.tex
+++ b/thesis/content/02-background/software-system-architecture.tex
@@ -1,7 +1,4 @@
 
				-The software system architecture describes the relationships and properties of individual software components. It is a model that describes a software on a high-level design. The structure of an architecture can be represented mathematically as a graph, with the nodes representing the individual software components and the edges their relationships to each other. Although the individual components can be executed on the same computer, they are usually interconnected via networks. In general, a distinction is made between centralized, decentralized and distributed architectures as shown in Figure \ref{fig:software-system-architecture}.
			
 
				-
			
 
				-In the following, the characteristics and peculiarities of the different architectures are described in detail.
			
 
				-
			
 
				+The software system architecture describes the relationships and properties of individual software components. It is a model that describes a software on a high-level design. The structure of an architecture can be represented mathematically as a graph, with the nodes representing the individual software components and the edges their relationships to each other. Although the individual components can be executed on the same computer, they are usually interconnected via networks. In general, a distinction is made between centralized, decentralized and distributed architectures as shown in Figure \ref{fig:software-system-architecture}. In the following, the characteristics and peculiarities of the different architectures are described in detail.
			
 
				 
			
 
				 \begin{figure}[h!]
			
 
				 	\centering
			
@@ -73,3 +70,5 @@ Table \ref{tab:comparison-architectures} compares the main features of the diffe
 
				 	\caption{Comparison of different software system architectures on scalability, maintenance, system stability, performance, and data availability. The pluses indicate how positive something is relative to the other systems.}
			
 
				 	\label{tab:comparison-architectures}
			
 
				 \end{table}
			
 
				+
			
 
				+\pagebreak
			
--- a/thesis/content/03-related-work/activitypub.tex
+++ b/thesis/content/03-related-work/activitypub.tex
@@ -12,9 +12,7 @@ Users are called actors in ActivityPub and are represented by an associated acco
 
				 	\label{fig:activitypub-communication}
			
 
				 \end{figure}
			
 
				 
			
 
				-The outbox of an actor holds all his published posts. When accessing the outbox of an actor without authorization, the server delivers all public posts of the actor. If access with authorization occurs, the explicitly shared content is also transmitted.
			
 
				-
			
 
				-The corresponding actors can only access their own inbox. On access, the messages are downloaded from the server. New messages get in the inbox per \ac{HTTP} POST request from another server.
			
 
				+The outbox of an actor holds all his published posts. When accessing the an actor's outbox without authorization, the server delivers all public posts of the actor. If access with authorization occurs, the explicitly shared content is also transmitted. The corresponding actors can only access their own inbox. On access, the messages are downloaded from the server. New messages get in the inbox per \ac{HTTP} POST request from another server.
			
 
				 
			
 
				 Each actor has some so-called collections. Objects (depends on the collection, e.g., accounts, posts) can be removed or added to the collections. The collections are used to store information related to an actor. These are the collections each actor has:
			
 
				 
			
--- a/thesis/content/03-related-work/facecloak.tex
+++ b/thesis/content/03-related-work/facecloak.tex
@@ -15,7 +15,7 @@ After validating several available solutions for personal data protection, the r
 
				 
			
 
				 \begin{figure}[h!]
			
 
				 	\centering
			
 
				-	\includegraphics[width=0.7\textwidth]{facecloak-architecture}
			
 
				+	\includegraphics[width=0.6\textwidth]{facecloak-architecture}
			
 
				 	\caption{Schematic representation of the Setup Phase (1), Encryption Phase (2) and Decryption Phase (3) and the data flow taking place between the entities in FaceCloak's architecture. \cite{luo2009facecloak}}
			
 
				 	\label{fig:facecloak-architecture}
			
 
				 \end{figure}
			
@@ -31,13 +31,9 @@ In addition to adhering to the above design principles, the proposed architectur
 
				 \end{itemize}
			
 
				 
			
 
				 \subsubsection{FaceCloak for Facebook}
			
 
				-To protect the privacy of Facebook users, Luo, Xiu, and Hengartner have created a Firefox browser extension according to the previously described architecture, as well as a server application for storing encrypted real data \cite{facecloakXXXXdownload}.
			
 
				+To protect the privacy of Facebook users, Luo, Xiu, and Hengartner have created a Firefox browser extension according to the previously described architecture, as well as a server application for storing encrypted real data \cite{facecloakXXXXdownload}. The extension uses \ac{AES} and a key length of 128 bits to encrypt the data. The indices for the encrypted data are calculated using SHA-1. The authors propose an e-mail for the key exchange. For this purpose, the browser extension automatically generates e-mail texts and recipient lists and forwards them to the standard e-mail program. The recipients then have to store the received keys in the extension manually.
			
 
				 
			
 
				-The extension uses \ac{AES} and a key length of 128 bits to encrypt the data. The indices for the encrypted data are calculated using SHA-1. The authors propose an e-mail for the key exchange. For this purpose, the browser extension automatically generates e-mail texts and recipient lists and forwards them to the standard e-mail program. The recipients then have to store the received keys in the extension manually.
			
 
				-
			
 
				-In order to protect data with FaceCloak, the prefix @@ must be added to the information in a text field. For other form elements such as dropdowns, radio buttons or checkboxes, the extension creates additional options that also start with @@. When submitting the form, the extension intervenes and replaces the data marked with @@ with fake data. The data to be protected are encrypted with the stored keys and transferred as a key-value pair to the third party server where it is stored. FaceCloak can protect all profile information, but only for name, birthday, and gender algorithms for the meaningful creation of fake data are implemented.
			
 
				-
			
 
				-In addition to profile information, the extension can also protect Facebook Wall and Facebook Notes data. The contents of arbitrary Wikipedia articles are transmitted as fake data to avoid attracting attention with random and unusual character strings.
			
 
				+In order to protect data with FaceCloak, the prefix @@ must be added to the information in a text field. For other form elements such as dropdowns, radio buttons or checkboxes, the extension creates additional options that also start with @@. When submitting the form, the extension intervenes and replaces the data marked with @@ with fake data. The data to be protected are encrypted with the stored keys and transferred as a key-value pair to the third party server where it is stored. FaceCloak can protect all profile information, but only for name, birthday, and gender algorithms for the meaningful creation of fake data are implemented. In addition to profile information, the extension can also protect Facebook Wall and Facebook Notes data. The contents of arbitrary Wikipedia articles are transmitted as fake data to avoid attracting attention with random and unusual character strings.
			
 
				 
			
 
				 When loading a profile page that contains protected data, the extension with asynchronous \ac{HTTP} requests retrieves the information from the third party server, decrypts it, and replaces the fake data. A large part of the replacement can thus be performed during the load process so that the user does not see the fake data. However, since Facebook also loads content asynchronously, some replacements can only be performed with a time delay and the fake data are shortly visible.
			
 
				 
			
--- a/thesis/content/03-related-work/twitterize.tex
+++ b/thesis/content/03-related-work/twitterize.tex
@@ -8,6 +8,10 @@ Daubert et al. stated various demands on the proposed solution. Concerning the p
 
				 	\item \textbf{Anonymity}: A individual user should not be identifiable within a set of users (anonymity set).
			
 
				 \end{itemize}
			
 
				 
			
 
				+\vspace{3em}
			
 
				+
			
 
				+\pagebreak
			
 
				+
			
 
				 Also, concerning the design of the implementation:
			
 
				 
			
 
				 \begin{itemize}
			
--- a/thesis/content/05-proof-of-concept/building-block-view.tex
+++ b/thesis/content/05-proof-of-concept/building-block-view.tex
@@ -11,6 +11,8 @@ Figure \ref{fig:building-block-view} shows a black box view of which other syste
 
				 	\item User 
			
 
				 \end{itemize}
			
 
				 
			
 
				+Infura\footnote{https://infura.io/} is a service that provides access to Ethereum and \ac{IPFS} via a simple interface. Communication with the \ac{API} happens using \ac{HTTP} requests. The connection of \ac{IPFS} in Hybrid \ac{OSN} can thus be carried out in a simple way. The use of an additional system entails an extra risk typically. However, there is a JavaScript client for \ac{IPFS}, which can be integrated into Hybrid \ac{OSN} and thus the dependency on Infura would be omitted. For the creation of the prototype, the decision was made to use Infura for reasons of simplicity. Infura can be used for \ac{IPFS} free of charge and without registration.
			
 
				+
			
 
				 \begin{figure}[h!]
			
 
				 	\centering
			
 
				 	\includegraphics[width=1.0\textwidth]{building-block-view}
			
@@ -18,8 +20,6 @@ Figure \ref{fig:building-block-view} shows a black box view of which other syste
 
				 	\label{fig:building-block-view}
			
 
				 \end{figure}
			
 
				 
			
 
				-Infura\footnote{https://infura.io/} is a service that provides access to Ethereum and \ac{IPFS} via a simple interface. Communication with the \ac{API} happens using \ac{HTTP} requests. The connection of \ac{IPFS} in Hybrid \ac{OSN} can thus be carried out in a simple way. The use of an additional system entails an extra risk typically. However, there is a JavaScript client for \ac{IPFS}, which can be integrated into Hybrid \ac{OSN} and thus the dependency on Infura would be omitted. For the creation of the prototype, the decision was made to use Infura for reasons of simplicity. Infura can be used for \ac{IPFS} free of charge and without registration.
			
 
				-
			
 
				 \subsection{White Box View}
			
 
				 \label{sec:white-box}
			
 
				 The Ionic framework uses Angular in the core. Accordingly, the Hybrid \ac{OSN} app is in principle an Angular application. The essential building blocks are components, pages, and providers (see Figure \ref{fig:building-block-view-level1}). In the following, these components are described in detail and examples are given of where they are used in Hybrid \ac{OSN}.
			
@@ -49,6 +49,8 @@ Data access is performed using providers (known as services in Angular). For the
 
				 	\caption{Providers used in the Hybrid \ac{OSN} app in alphabetical order with their purpose.}
			
 
				 	\label{tab:providers}
			
 
				 \end{table}
			
 
				+\raggedbottom
			
 
				+\pagebreak
			
 
				 
			
 
				 \subsubsection{Components}
			
 
				 \label{sec:components}
			
@@ -87,6 +89,9 @@ Table \ref{tab:pages} lists all pages and their purpose. When the app is opened,
 
				 	\label{tab:pages}
			
 
				 \end{table}
			
 
				 
			
 
				+\raggedbottom
			
 
				+\pagebreak
			
 
				+
			
 
				 \subsubsection{Local Storage}
			
 
				 \label{sec:local-storage}
			
 
				 As the name suggests, this is a local storage that is accessible by the app. With Hybrid \ac{OSN}, this memory is used to store essential information for usage. These include the Twitter user id, the two tokens for accessing the Twitter \ac{API}, the keywords that trigger the private mode, and private and public keys for encryption. Log out completely deletes the local storage.
			
--- a/thesis/content/05-proof-of-concept/osn-selection.tex
+++ b/thesis/content/05-proof-of-concept/osn-selection.tex
@@ -14,9 +14,8 @@ Even the mixed version of displaying and manipulating the mobile website in a We
 
				 
			
 
				 For this number of reasons, Facebook dropped out as an \ac{OSN} candidate for the prototype despite the particular interest. As a further candidate, the \ac{OSN} Google Plus was dropped, as Google announced in October 2018 that it would discontinue its \ac{OSN} \cite{google-plus2018shutdown}.
			
 
				 
			
 
				-Finally, Twitter was chosen for the prototype. With 321 million active users per month (average Q4 2018), it is one of the largest social networks \cite{twitter2019reportq4}. It is particularly well suited for the creation of a hybrid client for two reasons: first, it has a comprehensive \ac{API} that provides almost full functionality free of charge, and second, compared to Facebook, it offers only a few simple functions. These are the ideal prerequisites for the first proof of concept.
			
 
				+Finally, Twitter was chosen for the prototype. With 321 million active users per month (average Q4 2018), it is one of the largest social networks \cite{twitter2019reportq4}. It is particularly well suited for the creation of a hybrid client for two reasons: first, it has a comprehensive \ac{API} that provides almost full functionality free of charge, and second, compared to Facebook, it offers only a few simple functions. These are the ideal prerequisites for the first proof of concept. Twitter offers several \acp{API} for developers that serve different purposes. The current \acp{API} are \cite{twitterXXXXdev-getting-started}:
			
 
				 
			
 
				-Twitter offers several \acp{API} for developers that serve different purposes. The current \acp{API} are \cite{twitterXXXXdev-getting-started}:
			
 
				 \begin{itemize}
			
 
				 	\item \textbf{Standard \ac{API}}: the free and public \ac{API} offering basic query functionality and foundational access to Twitter data.
			
 
				 	\item \textbf{Premium \ac{API}}: introduced in November 2017 to close the gap between Standard and Entrprise \ac{API}. Improvements over the Standard \ac{API}: \enquote{more Tweets per request, higher rate limits, a counts endpoint that returns time-series counts of Tweets, more complex queries and metadata enrichments, such as expanded \acp{URL} and improved profile geo information}\cite{twitter2017premium-api}. Prices to use this \ac{API} start at 149\$/month.
			
--- a/thesis/content/05-proof-of-concept/technology-decisions.tex
+++ b/thesis/content/05-proof-of-concept/technology-decisions.tex
@@ -7,7 +7,9 @@ In order to meet the requirements of the concept best, a detailed consideration
 
				 
			
 
				 \subsection{Creation of a \ac{P2P} Network}
			
 
				 \label{sec:create-p2p-network}
			
 
				-The advantage of having an extra \ac{P2P} network is that it is completely under control. Accordingly, it can be designed to fit exactly to the use case and require little or no compromise. However, setting up a \ac{P2P} network is a big challenge and some hurdles must be overcome. These challenges include peer discovery (how peers find each other), global data exchange over the Internet and data storage, and availability of the stored data. In addition, all these requirements must scale. It should work for \ac{P2P} networks with only a few peers and also for a few thousand or even more peers. In the following, we discuss two options to create the \ac{P2P} network:
			
 
				+The advantage of having an extra \ac{P2P} network is that it is completely under control. Accordingly, it can be designed to fit exactly to the use case and require little or no compromise. However, setting up a \ac{P2P} network is a big challenge and some hurdles must be overcome. These challenges include peer discovery (how peers find each other), global data exchange over the Internet and data storage, and availability of the stored data. In addition, all these requirements must scale. It should work for \ac{P2P} networks with only a few peers and also for a few thousand or even more peers.
			
 
				+
			
 
				+In the following, we discuss two options to create the \ac{P2P} network:
			
 
				 
			
 
				 \begin{itemize}
			
 
				 	\item The use of an established standard such as Wi-Fi Dircet and \ac{WebRTC}