<html dir="ltr">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=Windows-1252">
<style>
<!--
@font-face
{font-family:"Cambria Math"}
@font-face
{font-family:Calibri}
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif"}
a:link, span.MsoHyperlink
{color:blue;
text-decoration:underline}
a:visited, span.MsoHyperlinkFollowed
{color:purple;
text-decoration:underline}
span.EmailStyle17
{font-family:"Calibri","sans-serif";
color:windowtext}
@page WordSection1
{margin:1.0in 1.0in 1.0in 1.0in}
-->
</style><style id="owaParaStyle" type="text/css">P {margin-top:0;margin-bottom:0;}</style><link href="resource://sharemenot-at-franziroesner-dot-com/sharemenot/data/Content Script/Content.css" type="text/css" rel="stylesheet"><link href="resource://sharemenot-at-franziroesner-dot-com/sharemenot/data/Content Script/Content.css" type="text/css" rel="stylesheet">
</head>
<body ocsi="0" fpstyle="1" lang="EN-US" link="blue" vlink="purple">
<div style="direction: ltr;font-family: Tahoma;color: #000000;font-size: 10pt;">Hi Alex,<br>
<br>
The short answer is that as long as the meaning of "editor" is the same in English and Spanish, it doesn't really matter.<br>
<br>
My gut's answer is that since editors are usually given in 508 notes and those notes are usually given in English, odds are that it's English.<br>
<br>
The long answer asks what are we hoping to do with this information.<br>
<br>
1. As a side effect of the project, we hope to build a multilingual dictionary of words that are actually used in moving image credits that could be a resource for catalogers and others. There are some resources out there (http://home.snafu.de/ohei/ofd/moviedict_e.html),
but I think this would be a useful addition. This is one reason why we ask for literal translations (they'll make more sense in a dictionary format). For training the computer, the translation really only has to reflect the correct category. We'll lump together
all the words that identify a director (directed, direction, directing, director, Regie, kantoku) into a group. For this, we really don't even need to know what language Regie is, just that the computer should use it to mark a name as a director. We do want
to be able to make groupings that distinguish assistant directors or stage directors so we do need a specific translation.<br>
<br>
2. Patterns found here may be useful in helping the computer identify the language of the credit and narrow down or prioritize the list of role words to look for. In the annotator, we provide what we think is the primary language of the movie (whatever is in
008 lang), but that doesn't necessarily say anything about the language of the credit. Credits given in notes are usually, but not always, translated into English. Credits for films in one language may be in another language. For example, some films in Arabic
or African languages in our sample have credits in French.<br>
<br>
It is hard for the computer to identify the language of very short passages, especially sentence fragments. If we know what movie languages commonly go with credits in a certain language, we may be able to make things more efficient for a computer.<br>
<br>
Kelley<br>
<br>
<div style="font-family: Times New Roman; color: #000000; font-size: 16px">
<hr tabindex="-1">
<div style="direction: ltr;" id="divRpF936649"><font color="#000000" face="Tahoma" size="2"><b>From:</b> olac-credits-bounces@lists.uoregon.edu [olac-credits-bounces@lists.uoregon.edu] on behalf of Kyrios, Alex (akyrios@uidaho.edu) [akyrios@uidaho.edu]<br>
<b>Sent:</b> Friday, December 13, 2013 1:51 PM<br>
<b>To:</b> olac-credits@lists.uoregon.edu<br>
<b>Subject:</b> [OLAC-credits] Same word, multiple languages<br>
</font><br>
</div>
<div></div>
<div>
<div class="WordSection1">
<p class="MsoNormal">How should we treat a credit when the language is indeterminate? Specifically, I’m getting “editor” in some Spanish records. If that’s the entirety of the credit, there’s no way to tell whether it’s English or Spanish. I could say “I don’t
know what the language is,” but I know it’s one of the two. I think I was coding these English until I realized that wasn’t necessarily correct.</p>
<p class="MsoNormal"> </p>
<p class="MsoNormal">Alex Kyrios</p>
<p class="MsoNormal">Metadata and Catalog Librarian</p>
<p class="MsoNormal">University of Idaho</p>
<p class="MsoNormal">208-885-2513</p>
<p class="MsoNormal">akyrios@uidaho.edu</p>
<p class="MsoNormal"> </p>
</div>
</div>
</div>
</div>
</body>
</html>