<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Data.gov and lessons from the open-source world</title>
	<atom:link href="http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/feed/" rel="self" type="application/rss+xml" />
	<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/</link>
	<description>Design by Ben Crothers of Catch Media</description>
	<lastBuildDate>Wed, 10 Mar 2010 06:21:37 +1100</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Mike</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1098</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Tue, 01 Sep 2009 12:19:39 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1098</guid>
		<description>&lt;blockquote&gt;What do you think? Should government departments embrace some of the principles of the open-source world in order to liberate public sector information?&lt;/blockquote&gt;

God no, unless we can implement restrictions that allow verification. Otherwise, go for it!</description>
		<content:encoded><![CDATA[<blockquote><p>What do you think? Should government departments embrace some of the principles of the open-source world in order to liberate public sector information?</p></blockquote>
<p>God no, unless we can implement restrictions that allow verification. Otherwise, go for it!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: matt</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1090</link>
		<dc:creator>matt</dc:creator>
		<pubDate>Tue, 01 Sep 2009 06:56:58 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1090</guid>
		<description>&lt;blockquote&gt;So a truly resilient and dynamic data.gov.au would hold -
- policies (like the above) with a bias towards ‘always open meta data’ and ‘usually open unless there are good reasons raw data’
- directory of pointers
- showcase of best of breed voted by inside and outside the government.

But it would not hold data or to the foreseeable future even meta data but as standards evolve and open perhaps it could hold meta data too.&lt;/blockquote&gt;

I don&#039;t have a problem with data.gov.au being a storage manager too.
I don’t think it&#039;s reasonable to for every data producer, or publisher to make their data available forever, and to take on those costs.

Suppose a small workgroup if formed to do some studies, and produce a quantity of data, should we expect them to host the data, and care for the infrastructure?   Better to have a process when they can &#039;seal&#039; the data and submit it to an archive, along with whatever metadata will aid discovery.

Perhaps we need to ask, how long will published data need to be available?
Then we can think about how that might be achieved (assuming the metadata and discovery are thrashed out satisfactorily).

Regards

Matt</description>
		<content:encoded><![CDATA[<blockquote><p>So a truly resilient and dynamic data.gov.au would hold -<br />
- policies (like the above) with a bias towards ‘always open meta data’ and ‘usually open unless there are good reasons raw data’<br />
- directory of pointers<br />
- showcase of best of breed voted by inside and outside the government.</p>
<p>But it would not hold data or to the foreseeable future even meta data but as standards evolve and open perhaps it could hold meta data too.</p></blockquote>
<p>I don&#8217;t have a problem with data.gov.au being a storage manager too.<br />
I don’t think it&#8217;s reasonable to for every data producer, or publisher to make their data available forever, and to take on those costs.</p>
<p>Suppose a small workgroup if formed to do some studies, and produce a quantity of data, should we expect them to host the data, and care for the infrastructure?   Better to have a process when they can &#8217;seal&#8217; the data and submit it to an archive, along with whatever metadata will aid discovery.</p>
<p>Perhaps we need to ask, how long will published data need to be available?<br />
Then we can think about how that might be achieved (assuming the metadata and discovery are thrashed out satisfactorily).</p>
<p>Regards</p>
<p>Matt</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter J Cooper (@pc0)</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1083</link>
		<dc:creator>Peter J Cooper (@pc0)</dc:creator>
		<pubDate>Tue, 01 Sep 2009 01:47:41 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1083</guid>
		<description>Alan great topic. 

I agree we need both the cathedral and the bazaar to get some dynamic tension. Generally Stephen, Mike, Andrae, Daniel, Mia, Tony, Hanare, Bec, Kerry, Ben are all on the right track in my mind. 

We also need to take some &lt;strong&gt;principle based&lt;/strong&gt; &#039;10 steps/rules&#039; to &lt;strong&gt;simplify&lt;/strong&gt; and ensure balance and break the current &#039;do nothing&#039; nexus [department heads note expect a mandate memo from your boss soon :-) about releasing meta data and most data] -

1. clearly state what is available ? ... meta data (structure data about data to be released immediately on request) .. OR plain data (release subject to controls see below) ... OR Hybrid 
2. treat &#039;sets&#039; as &#039;endless streams&#039; not as static point-in-time files like open source code (the stream of web site stats from a site does not have a version but the meta does)
3. treat sets as having endless variations on their meta, because we are always seeking further improvement, the joy of the journey so keep a meta version number and when it started/finished
4. state a clear owner for each set/stream and a clear point/s for community support [department heads note]
5. possibly distinguish between sets and streams (sets being point in time, streams being ongoing and more alive)
6. certainly hold and debate and refine all data in a decentralised way
7. index and create a central pointer directory with minimal barriers to listing or access or replication/federation of the directory and ideally the data
8. avoid picking a mandated technology, let the (federated) market decide, an initial short list may simplify and speed startup but no-exclusivity, no caps on new channels or access technologies
9. release everything unless clear legal/privacy/security reasons can be identified within (say) 14 days of a request (bear cost of materials?) and encourage minimal duplication through good access but don&#039;t mandate no-duplication
10. prohibit charging for raw data only for value adds

So a truly resilient and dynamic data.gov.au would hold -
- policies (like the above) with a bias towards &#039;always open meta data&#039; and &#039;usually open unless there are good reasons raw data&#039;
- directory of pointers
- showcase of best of breed voted by inside and outside the government.

But it would not hold data or to the foreseeable future even meta data but as standards evolve and open perhaps it could hold meta data too.

Thoughts?

Cheers, Pete.</description>
		<content:encoded><![CDATA[<p>Alan great topic. </p>
<p>I agree we need both the cathedral and the bazaar to get some dynamic tension. Generally Stephen, Mike, Andrae, Daniel, Mia, Tony, Hanare, Bec, Kerry, Ben are all on the right track in my mind. </p>
<p>We also need to take some <strong>principle based</strong> &#8216;10 steps/rules&#8217; to <strong>simplify</strong> and ensure balance and break the current &#8216;do nothing&#8217; nexus [department heads note expect a mandate memo from your boss soon <img src='http://gov2.net.au/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' />  about releasing meta data and most data] -</p>
<p>1. clearly state what is available ? &#8230; meta data (structure data about data to be released immediately on request) .. OR plain data (release subject to controls see below) &#8230; OR Hybrid<br />
2. treat &#8217;sets&#8217; as &#8216;endless streams&#8217; not as static point-in-time files like open source code (the stream of web site stats from a site does not have a version but the meta does)<br />
3. treat sets as having endless variations on their meta, because we are always seeking further improvement, the joy of the journey so keep a meta version number and when it started/finished<br />
4. state a clear owner for each set/stream and a clear point/s for community support [department heads note]<br />
5. possibly distinguish between sets and streams (sets being point in time, streams being ongoing and more alive)<br />
6. certainly hold and debate and refine all data in a decentralised way<br />
7. index and create a central pointer directory with minimal barriers to listing or access or replication/federation of the directory and ideally the data<br />
8. avoid picking a mandated technology, let the (federated) market decide, an initial short list may simplify and speed startup but no-exclusivity, no caps on new channels or access technologies<br />
9. release everything unless clear legal/privacy/security reasons can be identified within (say) 14 days of a request (bear cost of materials?) and encourage minimal duplication through good access but don&#8217;t mandate no-duplication<br />
10. prohibit charging for raw data only for value adds</p>
<p>So a truly resilient and dynamic data.gov.au would hold -<br />
- policies (like the above) with a bias towards &#8216;always open meta data&#8217; and &#8216;usually open unless there are good reasons raw data&#8217;<br />
- directory of pointers<br />
- showcase of best of breed voted by inside and outside the government.</p>
<p>But it would not hold data or to the foreseeable future even meta data but as standards evolve and open perhaps it could hold meta data too.</p>
<p>Thoughts?</p>
<p>Cheers, Pete.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrae Muys</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1082</link>
		<dc:creator>Andrae Muys</dc:creator>
		<pubDate>Mon, 31 Aug 2009 23:57:00 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1082</guid>
		<description>Again, you need to be careful of your terminology here, and avoid over simplification.

Accessible data is good - accessible data with an atom or rsync feed of updates is better.

Federated search is good - but only if you have a standard query interface[1]
but not if you need to perform joins or correlated queries.

When data is open anyway, where is the security risk?  The actual risk is not security, but currency - hence my mention of atom/rsync above.

So the recommendation will need to be more nuanced.  If we are willing to talk actual technologies[2], closer to:

1) Structured data should be made available in an open format with a well defined mapping into RDF (using standard vocabularies and RDFS/SKOS/OWL)[3]
2) Metadata should be made available in RDF compliant with AGRkMS, DCMI; with any extensions described in RDFS; supporting thesauri in SKOS; and ontology in OWL.
3) In addition, both structured-data and metadata should be exposed as a SPARQL endpoint.
4) Unstructured data should exposed using a full-text search api. Due to its field testing, the google search api is preferred (or even better a mapping of the GS-API into SPARQL).

It&#039;s not enough to just have access to the data - we need to know what it means (hence points 1&amp;2). It also isn&#039;t enough to just slap on a web-form frontend to a text-search box (as is currently done) to count as &quot;access provided&quot;, you need real query facilities, with standard interfaces. Otherwise the transation costs kill you.


[1] SPARQL is a good option for structured queries; Defining a mapping from the google search api to a virtual RDF graph would also allow you to use it for feature-oriented search.
[2] If we want to remain tech-neutral, then we need to expand each of these technologies into a description of the capability they provide, and specify that - but that&#039;s a lot more work, and not something I&#039;m going to attempt in a short post.
[3] D2RQ and similar technologies are available to map SQL databases into RDF with minimal difficulty; although for really large datasets there are going to be unaddressed scalability issues.</description>
		<content:encoded><![CDATA[<p>Again, you need to be careful of your terminology here, and avoid over simplification.</p>
<p>Accessible data is good &#8211; accessible data with an atom or rsync feed of updates is better.</p>
<p>Federated search is good &#8211; but only if you have a standard query interface[1]<br />
but not if you need to perform joins or correlated queries.</p>
<p>When data is open anyway, where is the security risk?  The actual risk is not security, but currency &#8211; hence my mention of atom/rsync above.</p>
<p>So the recommendation will need to be more nuanced.  If we are willing to talk actual technologies[2], closer to:</p>
<p>1) Structured data should be made available in an open format with a well defined mapping into RDF (using standard vocabularies and RDFS/SKOS/OWL)[3]<br />
2) Metadata should be made available in RDF compliant with AGRkMS, DCMI; with any extensions described in RDFS; supporting thesauri in SKOS; and ontology in OWL.<br />
3) In addition, both structured-data and metadata should be exposed as a SPARQL endpoint.<br />
4) Unstructured data should exposed using a full-text search api. Due to its field testing, the google search api is preferred (or even better a mapping of the GS-API into SPARQL).</p>
<p>It&#8217;s not enough to just have access to the data &#8211; we need to know what it means (hence points 1&amp;2). It also isn&#8217;t enough to just slap on a web-form frontend to a text-search box (as is currently done) to count as &#8220;access provided&#8221;, you need real query facilities, with standard interfaces. Otherwise the transation costs kill you.</p>
<p>[1] SPARQL is a good option for structured queries; Defining a mapping from the google search api to a virtual RDF graph would also allow you to use it for feature-oriented search.<br />
[2] If we want to remain tech-neutral, then we need to expand each of these technologies into a description of the capability they provide, and specify that &#8211; but that&#8217;s a lot more work, and not something I&#8217;m going to attempt in a short post.<br />
[3] D2RQ and similar technologies are available to map SQL databases into RDF with minimal difficulty; although for really large datasets there are going to be unaddressed scalability issues.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin Cox</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1081</link>
		<dc:creator>Kevin Cox</dc:creator>
		<pubDate>Mon, 31 Aug 2009 20:28:12 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1081</guid>
		<description>Mike

any repository of permanent data in data.gov.au defeats the purpose of gov2.0  - the power of the internet is that we link to sources of information - not make copies.  Data should be left where it was created and access provided. This would be a good &quot;principle&quot; for the Task Force to suggest as it would stop the proliferation of data with the attendant security risks.  Keep the source data in its place of origin and do not make copies for access.</description>
		<content:encoded><![CDATA[<p>Mike</p>
<p>any repository of permanent data in data.gov.au defeats the purpose of gov2.0  &#8211; the power of the internet is that we link to sources of information &#8211; not make copies.  Data should be left where it was created and access provided. This would be a good &#8220;principle&#8221; for the Task Force to suggest as it would stop the proliferation of data with the attendant security risks.  Keep the source data in its place of origin and do not make copies for access.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kevin Cox</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1080</link>
		<dc:creator>Kevin Cox</dc:creator>
		<pubDate>Mon, 31 Aug 2009 20:27:09 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1080</guid>
		<description>Mike

any repository of permanent data in data.gov.au defeats the purpose of gov2.0  - the power of the internet is that we link to sources of information - not make copies.  Data should be left where it was created and access provided. This would be a good &quot;principle&quot; for the Task Force to suggest as it would stop the proliferation of data with the attendant security risks.  Keep the source data in one place.</description>
		<content:encoded><![CDATA[<p>Mike</p>
<p>any repository of permanent data in data.gov.au defeats the purpose of gov2.0  &#8211; the power of the internet is that we link to sources of information &#8211; not make copies.  Data should be left where it was created and access provided. This would be a good &#8220;principle&#8221; for the Task Force to suggest as it would stop the proliferation of data with the attendant security risks.  Keep the source data in one place.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dave Bath</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1073</link>
		<dc:creator>Dave Bath</dc:creator>
		<pubDate>Mon, 31 Aug 2009 10:29:08 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1073</guid>
		<description>Yes to centralization of good AGLS, AGIFT and associated Dublin Core stuff.  All great if documents (PDF and Office Suite documents) actually had the metadata in them (how many standard PDFs of laws and statutes even have the AGLS data item &quot;jurisdiction&quot;?).

This centralization is pretty much like a standard that everybody conforms to so that things are interoperable.  (Think of all the rules coming out of the IETF, W3C, and the Linux Standards Base - central control of standards with distributed production using those standards)

But the choice of keywords and alternate terms is important.  Discrepancies between jurisdictions (each state and federal gov has their own thesaurus) make consolidation difficult.

If the keywords were properly filled in, it would be trivial to ask &quot;give me all the agriculture or water policies or media releases associated with the Grampians&quot; or &quot;give me all the current inquiries across all agencies and jurisdictions that have anything to do with pricing controls&quot;.

The same thing goes for laws and general efficiency.  How long did it take McClelland to go through all laws in Oz to discover all the things to do with secrecy?  (Proper discoverability using keywords and something like Google advanced operators would also make it easy on lawyers to check up on topics across different councils, states, etc - and thus easier on the pockets of their clients).

(Now, if only we could get RFC822 X-headers with AGLS/AGIFT contents put into, and displayed, by common webmail servers and email clients, then the majority of documents in government could be discovered easily by people in the same or different agencies).

It&#039;s also worth noting that the AGLS defines the authoritative source URL for the document (and there is no reason why XML datasets couldn&#039;t include AGLS keywords at different levels of the schema, just like any other document).

Thus, there would be core raw-data producers, readily identifiable whoever the intermediary or value-adder is, using centrally defined thesauri, and preferably a core set of centrally-defined schemata.

So... maybe not &quot;Cathedral&quot;, but a decent sized central chapel that welcomes everybody to help define evolving &quot;dogma&quot;, then the &quot;primary producers&quot; creating the base data, then the &quot;industry&quot; (including cottages) that add value and sell it in the bazaar.</description>
		<content:encoded><![CDATA[<p>Yes to centralization of good AGLS, AGIFT and associated Dublin Core stuff.  All great if documents (PDF and Office Suite documents) actually had the metadata in them (how many standard PDFs of laws and statutes even have the AGLS data item &#8220;jurisdiction&#8221;?).</p>
<p>This centralization is pretty much like a standard that everybody conforms to so that things are interoperable.  (Think of all the rules coming out of the IETF, W3C, and the Linux Standards Base &#8211; central control of standards with distributed production using those standards)</p>
<p>But the choice of keywords and alternate terms is important.  Discrepancies between jurisdictions (each state and federal gov has their own thesaurus) make consolidation difficult.</p>
<p>If the keywords were properly filled in, it would be trivial to ask &#8220;give me all the agriculture or water policies or media releases associated with the Grampians&#8221; or &#8220;give me all the current inquiries across all agencies and jurisdictions that have anything to do with pricing controls&#8221;.</p>
<p>The same thing goes for laws and general efficiency.  How long did it take McClelland to go through all laws in Oz to discover all the things to do with secrecy?  (Proper discoverability using keywords and something like Google advanced operators would also make it easy on lawyers to check up on topics across different councils, states, etc &#8211; and thus easier on the pockets of their clients).</p>
<p>(Now, if only we could get RFC822 X-headers with AGLS/AGIFT contents put into, and displayed, by common webmail servers and email clients, then the majority of documents in government could be discovered easily by people in the same or different agencies).</p>
<p>It&#8217;s also worth noting that the AGLS defines the authoritative source URL for the document (and there is no reason why XML datasets couldn&#8217;t include AGLS keywords at different levels of the schema, just like any other document).</p>
<p>Thus, there would be core raw-data producers, readily identifiable whoever the intermediary or value-adder is, using centrally defined thesauri, and preferably a core set of centrally-defined schemata.</p>
<p>So&#8230; maybe not &#8220;Cathedral&#8221;, but a decent sized central chapel that welcomes everybody to help define evolving &#8220;dogma&#8221;, then the &#8220;primary producers&#8221; creating the base data, then the &#8220;industry&#8221; (including cottages) that add value and sell it in the bazaar.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike Nelson</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1069</link>
		<dc:creator>Mike Nelson</dc:creator>
		<pubDate>Mon, 31 Aug 2009 06:54:57 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1069</guid>
		<description>The only sustainable solution is to mandate the use of open standards. If NSW State Records cannot be convinced to make Keyword AAA open (like Adobe did with PDF) then an alternative open, stable, machine interpretable controlled vocabulary needs to be developed.

At the most basic level, can you even reference &quot;Australia&quot; as a URI in such a way that a machine can know the difference between the Australia in the physical/geographical sense and Australia in the abstract/political/legal sense?

The traditional government method of getting a &quot;product&quot; like an ontology would be to pay a consultant to come up with a specification, put it out a tender, pay a contractor to develop it and get a finished product. The &quot;product&quot; then remains largely static, assuming it even works, and any maintenance requires paying the contractor to make changes.

Suppose instead an ontology was developed like any other open source or open standard project. If it is truly open then it could be developed by volunteer experts without having to go through the mess of tenders and contracts. Open standards projects like W3C generally have much better quality control than anything proprietary and developers will put a surprising amount of their own time into making it work the way it is intended.

My suggestion: pick a non-trivial ontology that needs to be developed and let it run as an open source project.</description>
		<content:encoded><![CDATA[<p>The only sustainable solution is to mandate the use of open standards. If NSW State Records cannot be convinced to make Keyword AAA open (like Adobe did with PDF) then an alternative open, stable, machine interpretable controlled vocabulary needs to be developed.</p>
<p>At the most basic level, can you even reference &#8220;Australia&#8221; as a URI in such a way that a machine can know the difference between the Australia in the physical/geographical sense and Australia in the abstract/political/legal sense?</p>
<p>The traditional government method of getting a &#8220;product&#8221; like an ontology would be to pay a consultant to come up with a specification, put it out a tender, pay a contractor to develop it and get a finished product. The &#8220;product&#8221; then remains largely static, assuming it even works, and any maintenance requires paying the contractor to make changes.</p>
<p>Suppose instead an ontology was developed like any other open source or open standard project. If it is truly open then it could be developed by volunteer experts without having to go through the mess of tenders and contracts. Open standards projects like W3C generally have much better quality control than anything proprietary and developers will put a surprising amount of their own time into making it work the way it is intended.</p>
<p>My suggestion: pick a non-trivial ontology that needs to be developed and let it run as an open source project.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dale Bowerman</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1067</link>
		<dc:creator>Dale Bowerman</dc:creator>
		<pubDate>Mon, 31 Aug 2009 04:04:52 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1067</guid>
		<description>I think the following quote sums up an important aspect of what needs to be considered.

Opengov means open &#039;linked data standards&#039; - Tim-Berners Lee [including video from TED]: http://opengov.ideascale.com/akira/dtd/5489-4049</description>
		<content:encoded><![CDATA[<p>I think the following quote sums up an important aspect of what needs to be considered.</p>
<p>Opengov means open &#8216;linked data standards&#8217; &#8211; Tim-Berners Lee [including video from TED]: <a href="http://opengov.ideascale.com/akira/dtd/5489-4049" rel="nofollow">http://opengov.ideascale.com/akira/dtd/5489-4049</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Andrae Muys</title>
		<link>http://gov2.net.au/blog/2009/08/26/lessons-from-the-open-source-world/comment-page-1/#comment-1066</link>
		<dc:creator>Andrae Muys</dc:creator>
		<pubDate>Mon, 31 Aug 2009 03:39:05 +0000</pubDate>
		<guid isPermaLink="false">http://gov2.net.au/?p=662#comment-1066</guid>
		<description>It&#039;s worth keeping in mind that AGLS (and the 2008 version, AGRkMS) is only half the equation. Having a standard set of properties, with associated URIs, and machine readable RDFS to provide definitions is a critical first-step. The second step is ensuring the various controlled vocabularies are open, accessible, locatable, and machine interpretable.

Open - they need to be available to the general public, without legal encumbrance. For example, &quot;Keyword AAA&quot; is a proprietary specification owned and licensed for fee by NSW State Records (http://www.records.nsw.gov.au/recordkeeping/keyword-products/keyword-aaa). Unfortunately it is also included by reference by the NAA and every (AFAIK) state records agency.

Accessible - they need to be available at a stable URL (or PURL) without the need to access them via intermediary links (ie. click-license, or disclaimer) preventing direct access by computers without human intervention.

Locatable - the stable URL needs to be included in the various descriptions of the dependent standards, so programs can automatically pull the definition file.

Machine Interpretable - the definition needs to be available in a standard machine interpretable format. The preferred format would be an RDF serialisation of a SKOS thesaurus. RDFa is an ideal option that can permit the file to be served simultaneously as a human-interpretable XHTML file and a machine-interpretable RDF document.</description>
		<content:encoded><![CDATA[<p>It&#8217;s worth keeping in mind that AGLS (and the 2008 version, AGRkMS) is only half the equation. Having a standard set of properties, with associated URIs, and machine readable RDFS to provide definitions is a critical first-step. The second step is ensuring the various controlled vocabularies are open, accessible, locatable, and machine interpretable.</p>
<p>Open &#8211; they need to be available to the general public, without legal encumbrance. For example, &#8220;Keyword AAA&#8221; is a proprietary specification owned and licensed for fee by NSW State Records (<a href="http://www.records.nsw.gov.au/recordkeeping/keyword-products/keyword-aaa)" rel="nofollow">http://www.records.nsw.gov.au/recordkeeping/keyword-products/keyword-aaa)</a>. Unfortunately it is also included by reference by the NAA and every (AFAIK) state records agency.</p>
<p>Accessible &#8211; they need to be available at a stable URL (or PURL) without the need to access them via intermediary links (ie. click-license, or disclaimer) preventing direct access by computers without human intervention.</p>
<p>Locatable &#8211; the stable URL needs to be included in the various descriptions of the dependent standards, so programs can automatically pull the definition file.</p>
<p>Machine Interpretable &#8211; the definition needs to be available in a standard machine interpretable format. The preferred format would be an RDF serialisation of a SKOS thesaurus. RDFa is an ideal option that can permit the file to be served simultaneously as a human-interpretable XHTML file and a machine-interpretable RDF document.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
