{"@context":{"rdf":"http://www.w3.org/1999/02/22-rdf-syntax-ns#","rdfs":"http://www.w3.org/2000/01/rdf-schema#","owl":"http://www.w3.org/2002/07/owl#","foaf":"http://xmlns.com/foaf/0.1/","dc":"http://purl.org/dc/elements/1.1/","dct":"http://purl.org/dc/terms/","sioc":"http://rdfs.org/sioc/types#","blog":"http://vocab.amy.so/blog#","as":"https://www.w3.org/ns/activitystreams#","mf2":"http://microformats.org/profile/","ldp":"http://www.w3.org/ns/ldp#","solid":"http://www.w3.org/ns/solid#","view":"https://terms.rhiaro.co.uk/view#","asext":"https://terms.rhiaro.co.uk/as#","dbp":"http://dbpedia.org/property/","geo":"http://www.w3.org/2003/01/geo/wgs84_pos#","doap":"http://usefulinc.com/ns/doap#","time":"http://www.w3.org/2006/time#"},"@id":"https://rhiaro.co.uk/2013/10/remote-sparql","@type":"as:Article","as:actor":{"@id":"https://rhiaro.co.uk/about#me"},"as:content":"<p>Didn't have much success talking to the Dydra SPARQL endpoint yesterday.  I was briefly worried as there are no docs describing how to write back to the SPARQL endpoint, so I thought that was write-off at once, but then I found a <a href=\"http://blog.dydra.com/2011/09/07/sparql-11\">blog post</a> from 2011 about how that has been introduced.  Just not documented yet apparently.</p>\r\n<p>But to start with, I imported some test triples using the Web interface, into dydra.com/rhiaro/about-me and tried to read them back.</p>\r\n<p>With ARC2, along the lines of:</p>\r\n<pre><code>include_once(\"ARC2/ARC2.php\");\r\n\r\n$config = array(\r\n    'remote_store_endpoint' => 'http://dydra.com/rhiaro/about-me/sparql'\r\n);\r\n\r\n$store = ARC2::getRemoteStore($config);\r\n$query = 'select * where {?s ?p ?o} limit 20';\r\n$rows = $store->query($query, 'rows');</code></pre>\r\n<p>But all I got back was an empty array.  I tried with with the DBPedia endpoint, which fell over a couple of times, but I got results... except... they were different from the results I got when I queried the endpoint directly through their interface.  They seemed sort of metadata-y, rather than actual triples from the store.  But it's hard to tell.</p>\r\n<p>So I had a go with Python's RDFLib to try to figure out who had the problem.</p>\r\n<pre><code>import rdflib\r\n\r\nrdflib.plugin.register('sparql', rdflib.query.Processor, 'rdfextras.sparql.processor', 'Processor')\r\nrdflib.plugin.register('sparql', rdflib.query.Result, 'rdfextras.sparql.query', 'SPARQLQueryResult')\r\n\r\ng = rdflib.Graph()\r\n\r\nquery = \"\"\"\r\n        SELECT *\r\n        FROM <http://dydra.com/rhiaro/about-me/sparql>\r\n        WHERE {\r\n             ?s ?p ?o .\r\n        }Limit 10\r\n    \"\"\"\r\n\r\nfor row in g.query(query):\r\n    print row</code></pre>\r\n<p>And with that I got some triples... but not from the triplestore.  It parsed, I presume, whatever semantic markup it could find in the page itself, the page you see when you visit dydra.com/rhiaro/about-me/sparql.  Eg.</p>\r\n<pre><code>(rdflib.term.URIRef(u'https://s3.amazonaws.com/public.dydra.com/stylesheets/style.css?1337867890'), \r\nrdflib.term.URIRef(u'http://www.w3.org/1999/xhtml/vocab#stylesheet'), \r\nrdflib.term.URIRef(u'http://dydra.com/rhiaro/about-me/sparql'))</code></pre>\r\n<p>Do I have to send an accept header?  Surely RDFLib is supposed to take care of that for me... Whatever.</p>\r\n<p>If that's how you're going to play it, I'll just make the request with CURL directly.  (I used <a href=\"http://docs.python-requests.org/\">Python's Requests</a> because the Web says it's nicer than urllib2):</p>\r\n<pre><code>import requests\r\nimport rdflib\r\n\r\nq = \"select * where {?s ?p ?o}\"\r\nurl = \"http://dydra.com/rhiaro/about-me/sparql\"\r\n\r\np = {'query': q}\r\nh = {'Accept': 'application/json'}\r\nr = requests.get(url, params=p, headers=h)\r\n\r\nprint r.text</code></pre>\r\n<p>Boom!  Triples!  Better yet... the ones in the triplestore!  By default (with no <code>Accept</code> header set) they come through as RDF/XML, and it won't give me Turtle, so JSON seems to be the nicest looking option.  That doesn't really matter though, as nobody really needs to look at it.</p>\r\n<p>I guess I'll try CURL with PHP for Slog'd, and just parse it with ARC2.  It seems a shame that ARC2's remote endpoint querying didn't Just Work with Dydra, but I don't have the time or energy to try to figure out why right now.</p>\r\n<p>Then I need to figure out if I can write to it or not.  If I can't... In the name of progressing, I'll have to ditch it and use ARC2's built in MySQL-based triplestore.</p>\r\n<h2>Update: Parsing the results with RDFLib</h2>\r\n<p>Because I want to understand exactly what Dyrda is giving back to me, I wanted to quickly parse the results and use them like I should be able to use a graph.</p>\r\n<p>The XML that Dydra is returning is not straightforward RDF/XML that RDFLib can just understand. It's a '<a href=\"http://www.w3.org/TR/rdf-sparql-XMLres/'\">SPARQL Result</a>. It looks like this:</p>\r\n<pre><code><sparql xmlns='http://www.w3.org/2005/sparql-results#'>\r\n <head> \r\n    <variable name='s'/> \r\n    <variable name='p'/> \r\n    <variable name='o'/> \r\n </head>\r\n <results>  \r\n    <result> \r\n        <binding name='s'>\r\n            <uri>https://rhiaro.co.uk/about#me</uri>\r\n        </binding> \r\n        <binding name='p'>\r\n            <uri>http://xmlns.com/foaf/0.1/homepage</uri>\r\n        </binding> \r\n        <binding name='o'>\r\n            <uri>https://rhiaro.co.uk</uri>\r\n        </binding> \r\n    </result>\r\n\r\n    ...etc</code></pre>\r\n<p>So later I either have to work out how to make RDFLib understand this, or make RDFLib understand the <a href=\"http://www.w3.org/TR/sparql11-results-json/\">JSON alternative</a>.  I really don't want to have to write a custom parser to deal with it.</p>\r\n<h2>Update: Solved</h2>\r\n<p>Turns out it's as simple as using <code>CONSTRUCT</code> instead of <code>SELECT</code> in the query.  Rookie mistake?  I don't know.  I feel like RDFLib ought to be able to handle the SPARQL results format somehow though.</p>","as:name":"Remote SPARQL endpoints and RDF parsing","as:published":{"@type":"http://www.w3.org/2001/XMLSchema#datetime","@value":"2013-10-21T13:02:00+0000"},"as:tag":[{"@id":"https://rhiaro.co.uk/tags/doing"},{"@id":"blog:Doing"},{"@id":"https://rhiaro.co.uk/tags/dydra"},{"@id":"https://rhiaro.co.uk/tags/hacking"},{"@id":"https://rhiaro.co.uk/tags/learning"},{"@id":"https://rhiaro.co.uk/tags/librdf"},{"@id":"https://rhiaro.co.uk/tags/linked+data"},{"@id":"https://rhiaro.co.uk/tags/phd"},{"@id":"https://rhiaro.co.uk/tags/python"},{"@id":"https://rhiaro.co.uk/tags/rdf"},{"@id":"https://rhiaro.co.uk/tags/rdflib"},{"@id":"https://rhiaro.co.uk/tags/redland"},{"@id":"https://rhiaro.co.uk/tags/semantic+web"},{"@id":"https://rhiaro.co.uk/tags/slogd"},{"@id":"https://rhiaro.co.uk/tags/sparql"}],"as:updated":{"@type":"http://www.w3.org/2001/XMLSchema#datetime","@value":"2013-10-21T18:24:00+0000"}}