<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://ideas.4brad.com" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>Brad Ideas - Data hosting instead of data portability - Comments</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability</link>
 <description>Comments for &quot;Data hosting instead of data portability&quot;</description>
 <language>en</language>
<item>
 <title>Data Spaces in the Clouds</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5316</link>
 <description>&lt;p&gt;Brad,&lt;/p&gt;
&lt;p&gt;What you describe is what I&#039;ve referred to as &lt;a href=&quot;http://www.openlinksw.com/dataspace/kidehen@openlinksw.com/weblog/kidehen@openlinksw.com%27s%20BLOG%20%5B127%5D/1261&quot; rel=&quot;nofollow&quot;&gt;Data Spaces in the Clouds (Fourth Platform)&lt;/a&gt; for a while :-)&lt;/p&gt;
&lt;p&gt;Yes, there is some confusion about the literal interpretation of the phrase: Data Portability (free movement of data across platforms).  We don&#039;t necessarily want free movement of data across realms (unless we explicity enable it in our space). Instead, I believe we seek Open Access to our Data Spaces with access control granularity.&lt;/p&gt;
&lt;p&gt;To conclued, we do need Data Access by Reference facilitated by portable Data Containers (Data Spaces) in the Clouds. Of course, these containers can move themselves, or data from the clouds to other locations, wholesale or via replication and synchronixation. In all cases using standard protocols and existing infrastructure such as the Internet and Web.&lt;/p&gt;
&lt;p&gt;Links:&lt;/p&gt;
&lt;p&gt;1. &lt;a href=&quot;http://en.wikipedia.org/wiki/OpenLink_Data_Spaces&quot; rel=&quot;nofollow&quot;&gt;OpenLink Data Space Wikipedia Page&lt;/a&gt;&lt;br /&gt;
2. &lt;a href=&quot;http://virtuoso.openlinksw.com/wiki/main/Main/Ods&quot; rel=&quot;nofollow&quot;&gt;OpenLink Data Spaces (Open Source Edition) Home Page&lt;/a&gt;&lt;br /&gt;
3. &lt;a href=&quot;http://myopenlink.net/dataspace/person/kidehen&quot; rel=&quot;nofollow&quot;&gt; My Data Space Profile Page&lt;/a&gt;&lt;br /&gt;
4. &lt;a href=&quot;http://virtuoso.openlinksw.com/wiki/main/Main/Ods&quot; rel=&quot;nofollow&quot;&gt;EC2 installation Guide&lt;/a&gt;&lt;br /&gt;
5. &lt;a href=&quot;http://virtuoso.openlinksw.com/dataspace/dav/wiki/Main/GetAPersonalURIIn5MinutesOrLes&quot; rel=&quot;nofollow&quot;&gt;How to get a Gateway into your Data Space (i.e a URI for Your Data Space) in 5 minutes or less&lt;/a&gt;&lt;br /&gt;
6. Recent WWW2008 Presentation about Data Portability and Data Accessibility (&lt;a href=&quot;http://community.linkeddata.org/DAV/home/kidehen2/Public/DataPortability_and_DataSpaces.ppt&quot; rel=&quot;nofollow&quot;&gt;PPT&lt;/a&gt;)&lt;/p&gt;
</description>
 <pubDate>Thu, 08 May 2008 13:55:39 -0700</pubDate>
 <dc:creator>Kingsley Idehen</dc:creator>
 <guid isPermaLink="false">comment 5316 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>FoF apps</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5313</link>
 <description>&lt;p&gt;As I&amp;#8217;ve noted, FoF apps turn out to be much less interesting than people thought at first.   Do you really look at the photos of your FoFs?   The main FoF app that seems to be useful is LinkedIn&amp;#8217;s &amp;#8220;search your network&amp;#8221; which can answer questions like, &amp;#8220;Who can I contact at Company X&amp;#8221; and dating introductions.    FoFoF turns out to be surprisingly non-useful.&lt;/p&gt;

&lt;p&gt;However, I won&amp;#8217;t proclaim that nobody can think of useful apps (or simply entertaining) apps here.  So there need to be solutions, even if it turns out to be that those apps get access to large networks, but are the only ones that do.   (Remember, our alternative today is zillions of apps getting access to this data.)&lt;/p&gt;

&lt;p&gt;I don&amp;#8217;t expect home PCs to be required here.  Everybody wants an always on host for many functions.   You don&amp;#8217;t &lt;em&gt;need&lt;/em&gt; it in the sense that I don&amp;#8217;t think &amp;#8220;Ask 100 hosts to search for a query&amp;#8221; is a good implementation, but I would use the local hosts just as a way to do things efficiently &lt;em&gt;for the user when the user is signed on&lt;/em&gt;.   The cloud host would do things for others.   Client data hosts and cloud data hosts would sync.&lt;/p&gt;

&lt;p&gt;Your particular app, and other FoF apps, could be implemented, somewhat less efficiently, with data updates.   That is to say, if you are using an FoF
app, you would send changes to your friends, and they would forward those changes on to their friends as part of the update stream.   In this case everybody is storing all the basic data (not big things like photos, just smaller stuff, including the URLs/access tokens of the photos) on their own host, and apps can operate on it.   This is why it does not scale to FoFoF, but I think it could handle FoF if the updates are not large.&lt;/p&gt;

&lt;p&gt;Of course, you, your Fs and your FoFs must be running the same application, but that is the same as saying they are all members of Flickr.    To implement Flickr you need more though, and it may not even be possible to implement all apps in this manner.  Still better than implementing them all in a central repository manner, though.&lt;/p&gt;
</description>
 <pubDate>Wed, 07 May 2008 11:26:57 -0700</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">comment 5313 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Interesting approach</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5312</link>
 <description>&lt;p&gt;But I haven&amp;#8217;t studied the underlying systems proposed enough to judge if they can do it.   One concern I see immediately  regards whether developers can be talked into it.   Part of the sex appeal of web 2.0 (meaning apps in the cloud) is that developers get free reign to write and maintain their apps using whatever platforms and tools they like.   They are no longer limited to even the constraints and problems of writing code for a user&amp;#8217;s PC.   Users at the same time love not having to install software, having somebody else maintain it all, and having to roam.&lt;/p&gt;

&lt;p&gt;My own proposals face this problem too.  These abilities are very attractive to users and developers, and as long as they can get the functionality (which javascript has now given) they will rush to them.&lt;/p&gt;

&lt;p&gt;It is for this reason that I have decided that some compromises will be needed, that we won&amp;#8217;t get to the level where we can run a malicious app on our data.  That&amp;#8217;s because the programming hoops required to use a system that bars malicious apps may be too involved.   Happy to be proven wrong, though.   I would be happy just to reach the level where apps don&amp;#8217;t end up taking more data than they need, and don&amp;#8217;t end up storing copies of it.&lt;/p&gt;
</description>
 <pubDate>Wed, 07 May 2008 11:16:17 -0700</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">comment 5312 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>W5 Project at MIT</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5307</link>
 <description>&lt;p&gt;The W5 project at MIT is looking at ways to solve these issues:&lt;/p&gt;
&lt;p&gt;&lt;a href=&quot;http://pdos.csail.mit.edu/~max/docs/w5.pdf&quot; title=&quot;http://pdos.csail.mit.edu/~max/docs/w5.pdf&quot;&gt;http://pdos.csail.mit.edu/~max/docs/w5.pdf&lt;/a&gt;&lt;/p&gt;
</description>
 <pubDate>Wed, 07 May 2008 04:51:52 -0700</pubDate>
 <dc:creator>Evan Jones</dc:creator>
 <guid isPermaLink="false">comment 5307 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Searches</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5306</link>
 <description>&lt;p&gt;I really wonder how you can efficiently implement this use case:&lt;/p&gt;
&lt;p&gt;&quot;Show me the flickr photos that my friends and the friends of my friends faved in last 2 weeks, sorted by the total number of favs&quot;, while the photos are still stored on flickr.&lt;/p&gt;
&lt;p&gt;I can see you can implemente that with &quot;agent&quot; that will crawl to the hosting sites of your friends and their friends, collect the data and come back. But that would be slow, especially if some of your friends host their data in PCs that are currently offline (remember The Eight Fallacies of Distributed Computing?). Do you have any solution for that in mind?&lt;/p&gt;
&lt;p&gt;BTW, creating a Virtual Machine you still provide API to the applications. But it is quite broad, which means it is difficult to control security.&lt;/p&gt;
</description>
 <pubDate>Wed, 07 May 2008 01:14:33 -0700</pubDate>
 <dc:creator>Radovan Semancik</dc:creator>
 <guid isPermaLink="false">comment 5306 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Finding the host</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5300</link>
 <description>&lt;p&gt;Well, in the past we have used DNS as a way to name hosts and move them around.   This could be used here, but not a high level domain.    I might get bradsdata.datadns.com as a subdomain that I can point at whatever data host I like.  However, there could be other indirection or discovery protocols.&lt;/p&gt;

&lt;p&gt;Typically I see a main site providing your interface to social data apps.   That site would embed other web pages which are served from your data host, running code provided by app providers.    The DNS would direct your own browser at your own data host of choice.   (In fact, this architecture allows the data host to be your own PC, if you don&amp;#8217;t need to roam.   Your own pc at localhost:port would see a request for a social app window.    The data host program on your PC would connect to the specified remote application&amp;#8217;s server for any code updates or special data, download them if it doesn&amp;#8217;t have them cached, execute the code and return the results to the iframe in the browser page.)&lt;/p&gt;

&lt;p&gt;I think I&amp;#8217;ll update about that.&lt;/p&gt;

&lt;p&gt;Now as for the central repository:  This is complex.   People are saying, it seems, that they don&amp;#8217;t want their data scattered around everywhere, both because of lack of control, but more commonly because the UI to give apps access to it is too complex.  If we can develop a good UI so that it is easy to give apps just the data they need, and no more, then scattering can be good.   The data hosting model does not dictate about scattering or centralization, but I agree that users will tend to centralize, just for ease of control.     A central server contracted to me may be better than 30 servers with only loose bonds to me, such as 30 different social app companies each knowing different subsets of my data.&lt;/p&gt;
</description>
 <pubDate>Tue, 06 May 2008 14:48:00 -0700</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">comment 5300 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Service portability good. Centralized servers not necessary.</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5299</link>
 <description>&lt;p&gt;Good post.&lt;/p&gt;
&lt;p&gt;It&#039;s clear that the data portability model is limited, but I would even go beyond hosting portability to service portability, and I would separate the hosting company from the value-added service provider.&lt;/p&gt;
&lt;p&gt;First, the hosting model still leaves you at the mercy of the host if/when you decide to move... you still have to update your address at all of the service providers who might access the services of your hosting company. A more robust model would provide service portability through service discovery. Service providers don&#039;t need to know where it the data is hosted, but rather where they can find the current host. That gives you a layer of indirection that lets you move your hosting company without needing to remember which service providers are currently relying on that host.&lt;/p&gt;
&lt;p&gt;Second, you suggest that the hosting company&#039;s job is to perform actions on your data.&lt;/p&gt;
&lt;p&gt;Why is that?&lt;/p&gt;
&lt;p&gt;My DNS host doesn&#039;t perform functions on my DNS. Nor do I expect my webhost to perform actions on my website, although I do like to have a range of services I can easily install and run (such as installing WordPress through Fantastico).&lt;/p&gt;
&lt;p&gt;I would argue that there is an inherent conflict of interest in the hosting company providing value-added services, and that in fact, what we should do is design an architecture where hosting is functionally distinct from value-added services.  &lt;em&gt;Any&lt;/em&gt; authorized value-add service provider should be able to access your data services, which leads to a cleaner architecture where companies that happen to provide &lt;em&gt;both&lt;/em&gt; hosting and value-add services can do so with clear contracts and authorization.&lt;/p&gt;
&lt;p&gt;Finally, there&#039;s no real need to have all of my services at the same hosting company, just like my DNS, my website, and my email can all painlessly be hosted anywhere I like. As long as the services can be discovered, there&#039;s no reason to have any individual&#039;s services centralized, nor a need to centralize many individuals&#039; data in one place.&lt;/p&gt;
&lt;p&gt;The collaborative/co-op type negotiating strategy you suggest can easily be implemented by a value-add service provider, independent of the source of the data.  Users simply join that co-op or buying group and point the co-op to their discovery service. Flash-mobs rejoice.&lt;/p&gt;
&lt;p&gt;All of which is to say, you are definitely going down the right path.  More portability more better.&lt;/p&gt;
&lt;p&gt;-j&lt;/p&gt;
</description>
 <pubDate>Tue, 06 May 2008 12:26:43 -0700</pubDate>
 <dc:creator>Joe Andrieu</dc:creator>
 <guid isPermaLink="false">comment 5299 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>It matters where it is hosted</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5298</link>
 <description>&lt;p&gt;Because we want to lay down the duties of the people who host it for us, if we don&amp;#8217;t host it ourselves.  (And most people won&amp;#8217;t host it themselves.)&lt;/p&gt;

&lt;p&gt;If a host has just one duty, to keep the data safe and allow only authorized actions on it, then that is what the host will focus on.   If the host is facebook, whose duty is to find ways to monetize its database to maximize shareholder value, you will get a different result.&lt;/p&gt;

&lt;p&gt;That&amp;#8217;s why the layers, and why for most people, not having too many hosts.   (Some might decide to have multiple hosts to maintain multiple persona, such as a personal and work persona, but that&amp;#8217;s their decision.)&lt;/p&gt;
</description>
 <pubDate>Tue, 06 May 2008 11:42:59 -0700</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">comment 5298 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Up to a point...</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability#comment-5291</link>
 <description>&lt;p&gt;I agree with most of your arguments, but am not sure about the conclusions - in particular I believe we essentially have most of the necessary infrastructure. Rather than thinking in terms of data being maintained in SQL DBs on or off the web, consider the web itself as the database. It really doesn&#039;t matter where the data is hosted as long as appropriate connections are in place. See &lt;a href=&quot;http://en.wikipedia.org/wiki/Linked_Data&quot; title=&quot;http://en.wikipedia.org/wiki/Linked_Data&quot;&gt;http://en.wikipedia.org/wiki/Linked_Data&lt;/a&gt;&lt;/p&gt;
</description>
 <pubDate>Tue, 06 May 2008 04:35:09 -0700</pubDate>
 <dc:creator>Danny Ayers</dc:creator>
 <guid isPermaLink="false">comment 5291 at http://ideas.4brad.com</guid>
</item>
<item>
 <title>Data hosting instead of data portability</title>
 <link>http://ideas.4brad.com/data-hosting-instead-data-portability</link>
 <description>&lt;p&gt;I&amp;#8217;ve been ranting of late about the dangers inherent in &amp;#8220;Data Portability&amp;#8221; which I would like to rename as &lt;a href=&quot;http://ideas.4brad.com/rename-data-portability-bepsi&quot;&gt;BEPSI&lt;/a&gt; to avoid the motherhood word &amp;#8220;portability&amp;#8221; for something that really has a strong dark side as well as its light side.&lt;/p&gt;

&lt;p&gt;But it&amp;#8217;s also important to come up with an alternative.   I think the best alternative may lie in what I would call &amp;#8220;data hosting.&amp;#8221;   It&amp;#8217;s a layered system, with a data layer and an application layer on top.&lt;/p&gt;

&lt;p&gt;A data hosting approach has your personal data stored on a server chosen by you.   (You might have that server right in your own house, or pay for hosting services.)  If you pay, that server&amp;#8217;s duty is not to exploit your data, but rather to protect it.  That&amp;#8217;s what you&amp;#8217;re paying for.   You can have more than one (with different personas, if you like) but for now let&amp;#8217;s imagine having just one.&lt;/p&gt;

&lt;p&gt;Your data host&amp;#8217;s job is to perform actions on your data.  Rather than giving copies of your data out to a thousand companies (the Facebook and Data Portability approach) you host the data and perform actions on it, programmed by those companies who are developing useful social applications.&lt;/p&gt;

&lt;p&gt;As such, you don&amp;#8217;t join a site like Facebook or LinkedIn.   Rather, companies like those build applications which can run on your data.   They don&amp;#8217;t get the data, rather they write code to do things with it that runs in a protected sandbox on your data host.&lt;/p&gt;
</description>
 <comments>http://ideas.4brad.com/data-hosting-instead-data-portability#comments</comments>
 <category domain="http://ideas.4brad.com/archives/cat_best_of_blog.html">Best Of Blog</category>
 <category domain="http://ideas.4brad.com/archives/cat_privacy.html">Privacy</category>
 <pubDate>Mon, 05 May 2008 20:08:19 -0700</pubDate>
 <dc:creator>brad</dc:creator>
 <guid isPermaLink="false">752 at http://ideas.4brad.com</guid>
</item>
</channel>
</rss>
