<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Moving Image Review Online: rebuild 2011</title>
	<atom:link href="http://movingimagereview.org/blog/feed/" rel="self" type="application/rss+xml" />
	<link>http://movingimagereview.org/blog</link>
	<description></description>
	<lastBuildDate>Fri, 23 Sep 2011 14:41:07 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>article level TEI</title>
		<link>http://movingimagereview.org/blog/2011/09/23/article-level-tei/</link>
		<comments>http://movingimagereview.org/blog/2011/09/23/article-level-tei/#comments</comments>
		<pubDate>Fri, 23 Sep 2011 14:26:17 +0000</pubDate>
		<dc:creator>teeter</dc:creator>
				<category><![CDATA[TEI]]></category>

		<guid isPermaLink="false">http://movingimagereview.org/blog/?p=149</guid>
		<description><![CDATA[The newest release of Best Practices for TEI in Libraries is anticipated in October, but the draft is now available and very useful. Oddly, keeping article level vs. issue level metadata from getting mixed up is somewhat mind-bending. A workable draft TEI header has been completed and the article level text has been added for [...]]]></description>
			<content:encoded><![CDATA[<p>The newest release of <cite><a href="http://www.tei-c.org/SIG/Libraries/teiinlibraries/">Best Practices for TEI in Libraries</a></cite> is anticipated in October, but the draft is now available and very useful. Oddly, keeping article level vs. issue level metadata from getting mixed up is somewhat mind-bending. A workable draft TEI header has been completed and the article level text has been added for a sample set of two issues. The next step is import into Greenstone for trial run of Greenstone METS profile for article level entry. The current TEI template is not complete and the trial run of Greenstone import will likely be flawed, but, this is simply a result of more work on proof of concept. Documentation of the work on semantic markup and structured metadata has been moved to a <a href="http://digress.it/">Digress.It</a> blog. The URL for that blog will be available once the TEI template is complete.</p>
]]></content:encoded>
			<wfw:commentRss>http://movingimagereview.org/blog/2011/09/23/article-level-tei/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>review: TEI and Greenstone METS</title>
		<link>http://movingimagereview.org/blog/2011/09/17/review-tei-and-greenstone-mets/</link>
		<comments>http://movingimagereview.org/blog/2011/09/17/review-tei-and-greenstone-mets/#comments</comments>
		<pubDate>Sat, 17 Sep 2011 17:15:50 +0000</pubDate>
		<dc:creator>teeter</dc:creator>
				<category><![CDATA[metadata]]></category>
		<category><![CDATA[METS & TEI]]></category>
		<category><![CDATA[project review]]></category>

		<guid isPermaLink="false">http://movingimagereview.org/blog/?p=118</guid>
		<description><![CDATA[As a result of the editorial work completed by Karan in 2007 on the single file of OCR from scans done by Internet Archive, the single individual issues of (MIR) Moving Image Review were broken into structured text files with matching &#8220;article&#8221; opening and closing tags. Article metadata was added in an attribute of &#8220;type&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>As a result of the editorial work completed by Karan in 2007 on the single file of OCR from scans done by <a href="http://www.archive.org/">Internet Archive</a>, the single individual issues of <cite>(MIR) Moving Image Review</cite> were broken into structured text files with matching &#8220;article&#8221; opening and closing tags. Article metadata was added in an attribute of &#8220;type&#8221; and repeated typos/misreadings created during the OCR process were corrected (details available but not published yet).<br />
e.g.:</p>
<blockquote><p><code>&lt;article type="Preservation"&gt;<br />
Two Decades Of<br />
TV Film To Be Preserved:<br />
Maine's largest and oldest broadcast<br />
collection. </p>
<p>WABI-TV, the Bangor Historical Society<br />
and NHF are cooperating to save and<br />
make accessible to the public an estimated<br />
300 hours (roughly 650,000 feet) of uni-<br />
que 16mm film containing news, sports<br />
and commercials. The film was shot by<br />
Maine's first TV broadcaster, WABI-TV in<br />
Bangor, between 1953 and 1974. </p>
<p>The footage had not been seen since it<br />
was put onto reels after airing on nightly<br />
news broadcasts. It has recently been</p>
<p>(continued on pg. 2)<br />
&lt;/article&gt; </code></p></blockquote>
<p>Karan converted at least one issue into individual article files that were to become .xml files for future testing, but the discrete documents of the 2007 version of <cite>MIR Online</cite> were <cite>MIR</cite> issues. </p>
<p>While prefacing each issues with a corpus TEI header was effective for bibliographic control of <cite>MIR</cite> issues for import into Greenstone, discerning the &#8220;type&#8221; of each article so that &#8220;article type&#8221; could become a browse term required a messy workaround. In Greenstone, documents are the building block of each Digital Library and the appropriate granularity for <cite>Moving Image Review</cite> is more clearly <em>article</em> as document instead of <em>issue</em> as document.</p>
<p>A test six-issue sub-set of <cite>MIR</cite> issues have been chosen for this iteration of <cite>MIR Online</cite>:</p>
<ul>
<li>Winter: 1988; 1989; &amp; 2003</li>
<li>Summer: 1998; 1999; &amp; 2003</li>
</ul>
<p>Currently, each issue is being broken down into a directory of articles. Articles on more than one page are being moved into the same file. BBEdit, Perl scripts, and regular expressions are being used to remove end-of-line hyphenation and line returns within paragraphs, a TEI header prepended, and the article title and text wrapped in TEI tags. Once this work on these six issues is completed, these source documents will be imported into Greenstone and the resulting source documents will be converted into the Greenstone METS profile.</p>
]]></content:encoded>
			<wfw:commentRss>http://movingimagereview.org/blog/2011/09/17/review-tei-and-greenstone-mets/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>metadata structure</title>
		<link>http://movingimagereview.org/blog/2011/09/12/mets_tei/</link>
		<comments>http://movingimagereview.org/blog/2011/09/12/mets_tei/#comments</comments>
		<pubDate>Mon, 12 Sep 2011 17:46:41 +0000</pubDate>
		<dc:creator>teeter</dc:creator>
				<category><![CDATA[METS & TEI]]></category>
		<category><![CDATA[project review]]></category>

		<guid isPermaLink="false">http://movingimagereview.org/blog/?p=71</guid>
		<description><![CDATA[Almost all details of the 2007 diagram of the metadata structure for Moving Image Review Online are still valid. But, the resources that were to be published in the Digital Library are more distributed than anticipated in 2007. In particular, moving image files stream from numerous sites, such as The Maine Memory Network and Windows [...]]]></description>
			<content:encoded><![CDATA[<p>Almost all details of the 2007 diagram of the metadata structure for <cite>Moving Image Review Online</cite> are still valid. But, the resources that were to be published in the Digital Library are more distributed than anticipated in 2007. In particular, moving image files stream from numerous sites, such as The <a href="http://www.mainememory.net/" title="Maine Memory">Maine Memory</a> Network and <a href="http://windowsonmaine.library.umaine.edu/" title="Windows on Maine">Windows on Maine</a>, in addition to portions of the Collections, such as <a href="http://oldfilm.org/collection/index.php/Browse/CollectionsList" title="clips served by NHF">clips now served by NHF</a>.</p>
<p><img src="http://www.movingimagereview.org/project_documents/MIR_METSdiagramv23Aug07.jpg" alt="MIR Metadata Structure" />[<a href="http://www.movingimagereview.org/project_documents/MIR_METSdiagramv23Aug07.pdf">download as .pdf</a>]</p>
<p>This will mean that metadata considerations will not be as broad as the above diagram indicates, but, will focus primarily on the articles of <cite>Moving Image Review</cite>. First draft of this Digital Library will encode each issue as a <a href="http://www.loc.gov/standards/mets/" title="METS">METS</a> file and each article of an issue as <a href="http://www.tei-c.org/index.xml">TEI</a>. XSLT will transform TEI/XML files to HTML5.</p>
<p>PDFs and still images may be added to this Digital Library, if time permits, at which point encoding will be <a href="http://www.loc.gov/standards/mods/">MODS</a> and <a href="http://www.loc.gov/standards/mix/">MIX</a>, respectively. But, because the primary task of this project is establishing <cite>Moving Image Review</cite> Online, text will be the only focus until all issues are available at <a href="http://www.movingimagereview.org">movingimagereview.org</a>.</p>
<p>The Greenstone METS profile and the encoding of the draft: <a href="http://www.movingimagereview.org/project_documents/mironline_teiheader_draft20080110.txt">January, 2008 corpus TEI header</a> are now under review.</p>
]]></content:encoded>
			<wfw:commentRss>http://movingimagereview.org/blog/2011/09/12/mets_tei/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

