User Tools


Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
public:nnels:etext:language [2020/06/03 09:08]
rachel.osolen
public:nnels:etext:language [2022/02/15 13:16]
rachel.osolen [Indigenous Languages]
Line 11: Line 11:
   - **Proper names**   - **Proper names**
     - Examples: Bellevue, Pierre     - Examples: Bellevue, Pierre
-  - **Technical terms**+  - **Technical and Scientific terms**
     - Examples: Homo sapiens, Alpha Centauri, hertz, and habeas corpus     - Examples: Homo sapiens, Alpha Centauri, hertz, and habeas corpus
     - Most professions require frequent use of technical terms which may originate from a foreign language. Such terms are usually not translated to all languages. The universal nature of technical terms also facilitate communication between professionals.     - Most professions require frequent use of technical terms which may originate from a foreign language. Such terms are usually not translated to all languages. The universal nature of technical terms also facilitate communication between professionals.
Line 46: Line 46:
   * This will open a pop up menu   * This will open a pop up menu
   * Select the appropriate language   * Select the appropriate language
 +  * Apply ''Strong'' style to the word or phrase
  
 +When passing the ticket to the Production Coordinator, please make note of what languages you used.
  
 +<note>The extra steps of applying ''Strong'' style and including a list of languages used in RT will help identify if they have been applied properly. The ''Strong'' style is removed for conversion by the Production Coordinator.
 +</note>
  
 +If you are working with a Windows computer, you may have to install the editing languages in order to apply them to the text. The following link will take you to a website that breaks down how to do this: [[https://www.customguide.com/word/how-to-change-language-on-word
 +]]
 =====For entire documents written in another language===== =====For entire documents written in another language=====
  
Line 54: Line 60:
  
 To change the document language on a Mac, you can follow these steps:  To change the document language on a Mac, you can follow these steps: 
-[[https://support.office.com/en-us/article/Check-spelling-and-grammar-in-a-different-language-in-Office-2016-for-Mac-0554be72-cd0e-49bd-a112-70ae2f0bf093|Change document language on a Mac]]+[[https://support.microsoft.com/en-us/topic/change-the-language-office-uses-in-its-menus-and-proofing-tools-f5c54ff9-a6fa-4348-a43c-760e7ef148f8#:~:text=Within%20any%20Office%20application%2C%20select,then%20select%20Set%20as%20Preferred.]]
  
 On a PC, Word should automatically detect the language of the document:  On a PC, Word should automatically detect the language of the document: 
-[[https://support.office.com/en-us/article/Check-spelling-and-grammar-in-a-different-language-667ba67a-a202-42fd-8596-edc1fa320e00|Change document language on a PC]]+[[https://support.microsoft.com/en-us/topic/change-the-language-office-uses-in-its-menus-and-proofing-tools-f5c54ff9-a6fa-4348-a43c-760e7ef148f8#ID0EBBF=Windows]]
  
 =====Indigenous Languages===== =====Indigenous Languages=====
  
-Currently, we are not able to create a Language Style in Word for Indigenous Languages. In the future hope we will be able to create Language Styles, but for now we still want to be able to markup these languages so they are set apart from the surrounding text+Currently, we are not able to apply language mark up to Indigenous Languages in Microsoft Word. 
  
-There are two steps for marking Indigenous Languages+There are span tags that have been created by the [[https://www.iana.org/assignments/language-subtag-registry/language-subtag-registry|IANA]] for a few Indigenous Languages. These span tags can be added later in the conversion process directly into the XML files for EPUB3 and DAISY textUnfortunately, screen readers do not recognize these tags at the time of reading this. Despite thiswe do want to add these tags in so when the technology catches up the language tags are there.
-  - Apply Emphasis style to the words and phrases the same way you would a Language Style. +
-  - Insert a Producer's Note at the beginning of the text to inform the reader what Indigenous Languages are in the bookand that Text-To-Speech is unable to pronounce these words.+
  
-<note>It is important you try to include the proper names of the Indigenous Languages in the Producer's Note Where you can, also include the Tribe nameSometimes this is clear in the book, and other times you may need to do a bit of research.  If you have any questions please contact the Project Coordinator +<note>You may notice that there are other languages in the IANA span library that Word does not currently supportWe unfortunately do not have the bandwidth at this time to accommodate all languages that are missingIn accordance with the TRC we do want to do our best to recognize all Indigenous Languages and work towards more inclusion of these languages in our work.</note>
-</note>+
  
-=====A note about poetry===== +This section will explain how to set up the Indigenous Languages in Word to help the Production Coordinator add the span tags during conversion.
-When you are working on poetry, you will **not** be able to apply a particular language style to words and phrases. In this case, you can just leave the Word version without language markup and use just the Poetry (''Poem (DAISY)'') style. Just make a note in the RT ticket that there are multiple languages.+
  
-=====Working with Images of Words and Different Alphabets=====+<note>Not all Indigenous Languages have span tags, and it is very important you are as specific as possible with identifying the language used in the book in the Producer's Note to help the Production Coordinator identify what tag to use.</note>
  
-Sometimes a word or phrase will appear as an image in line with the sentence instead of typed textThis is issue from the publisher. Words or phrases should not be formatted as images, but sometimes publishers do not follow these guidelines. When this happens you will need to transcribe the image of the term of phrase, and then apply the language style. Be sure to delete the images once you are done adding the text version.+There are two steps for marking Indigenous Languages: 
 +  - Apply Strong style to the words and phrases. 
 +  - Insert Producer's Note at the beginning of the text to inform the reader what Indigenous Languages are in the book, and that Text-To-Speech is unable to pronounce these words. 
 +  - Leave a comment in the RT ticket indicating what Indigenous Languages are in the book.
  
-<note>Some languages cannot be transcribed due to the complexity of that languageAn example would be Arabic. When it comes to languages like Arabicunless you are a native speaker you cannot transcribe it correctlyIn this case you would treat the image of the word like other images in the document and add Alt-Text stating it is an Arabic Word. You would then put Producers Note at the beginning of the book to explain why you did this. If you are unsure if the language is something you can safely transcribe please contact you supervisor for more feedback.</note>+<note>It is important you try to include the proper names of the Indigenous Languages in the Producer's NoteWhere you canalso include the Tribe nameSometimes this is clear in the book, and other times you may need to do bit of research. If you have any questions please contact the Project Coordinator 
 +</note>
  
-Sometimes the terms or phrases are typed out in line with the rest of the text, but with a language that uses a different alphabet. In this case, if the text appears as typed text, and not an image, then you can simply apply a language style to it as usual. +<WRAP center round box 80%>
  
 +**Example of Indigenous Language Producer's Note**
  
-In case you're not sure how to type in different languages, this is how you do it on a Mac [[https://support.office.com/en-us/article/Enable-keyboard-layouts-in-different-languages-in-Office-for-Mac-687f804e-4421-4a73-94b3-3febb538a7a1|Enable keyboard layouts in different languages in Office for Mac]] and [[https://support.office.com/en-us/article/Enable-or-change-a-keyboard-layout-language-1c2242c0-fe15-4bc3-99bc-535de6f4f258|Windows]].+Producer’s Note (heading 1)
  
-In other cases you can use ''unicode'' to enter the characters of the language. For more information on unicode go to the [[public:nnels:etext:symbols|Symbols]] page.+This book uses words and phrases written in [insert language name]Text-to-speech software will not be able to pronounce the Indigenous-language words correctly in this Word version(normal style)
  
-=====Applying language styles for DAISY Reformatting=====+French Translation:
  
-<note important>Do we want to keep this?</note>+Note de rédacteur
  
-The language can be set using styles at either the **paragraph** or **character** levelsFor entire paragraphs in a foreign language, we use a Paragraph style; for inline words or phrases in another language, we use a character style.+Ce livre comporte des mots et des phrases en [insert language name] 
 +Les synthèses vocales ne seront pas en mesure de prononcer correctement les mots en langue autochtone dans cette version en format Word.
  
-For example, in the image below, we can create a new Character style (let's call the style Turkish) and set the language to Turkish using the Format drop-down menu and selecting Language. +</WRAP>
  
-Following these steps will ensure that the text is spoken in the correct language, and converted into XML. 
  
-<note important>Do not create a language style that is not in the language options for Word. In these cases, create a [[public:nnels:etext:producers-note|Producer's Note]] stating what the language is, and how TTS will not be able to pronounce it properly. For Indigenous Languages, please see the section below on Indigenous Languages. When in doubt, ask the Production Coordinator before proceeding.</note>+=====Working with Images of Words and Different Alphabets=====
  
-====Step 1: Create new style (character or paragraph)====+Sometimes word or phrase will appear as an image in line with the sentence instead of typed text. This is a issue from the publisher. Words or phrases should not be formatted as images, but sometimes publishers do not follow these guidelines. When this happens you will need to transcribe the image of the term of phrase, and then apply the language style. Be sure to delete the images once you are done adding the text version.
  
-{{:public:nnels:turkish.png?direct400|Create style}}+<note>Some languages cannot be transcribed due to the complexity of that languageAn example would be Arabic. When it comes to languages like Arabic, unless you are native speaker you cannot transcribe it correctly. In this case you would treat the image of the word like other images in the document and add Alt-Text stating it is an Arabic Word. You would then put a Producers Note at the beginning of the book to explain why you did this. If you are unsure if the language is something you can safely transcribe please contact you supervisor for more feedback.</note>
  
-====Step 2: Go to ''Language'' in the drop-down menu==== +Sometimes the terms or phrases are typed out in line with the rest of the text, but with a language that uses a different alphabetIn this case, if the text appears as typed text, and not an image, then you can simply apply a language style to it as usual. 
- +
-{{:public:nnels:language_menu.png?direct&400|Go to Language in the drop-down}} +
- +
-====Step 3: Set the language of the text==== +
- +
-{{:public:nnels:language_select.png?direct&200|Select the language}}+
  
 +In case you're not sure how to type in different languages, this is how you do it on a Mac [[https://support.office.com/en-us/article/Enable-keyboard-layouts-in-different-languages-in-Office-for-Mac-687f804e-4421-4a73-94b3-3febb538a7a1|Enable keyboard layouts in different languages in Office for Mac]] and [[https://support.office.com/en-us/article/Enable-or-change-a-keyboard-layout-language-1c2242c0-fe15-4bc3-99bc-535de6f4f258|Windows]].
  
 +In other cases you can use ''unicode'' to enter the characters of the language. For more information on unicode go to the [[public:nnels:etext:symbols|Symbols]] page.
  
-=====Q&A=====+=====Q&Archive=====
  
 Q: I'm working on the play "1 Hour Photo." It contains a few Japanese characters but in the conversion, the characters were changed to Roman alphabet letters instead. The English translation is given for the symbols so I'm wondering if I should just erase the Roman alphabet letters. Or would it be better to insert the proper ideogram back in? If so, how do I do that?  Q: I'm working on the play "1 Hour Photo." It contains a few Japanese characters but in the conversion, the characters were changed to Roman alphabet letters instead. The English translation is given for the symbols so I'm wondering if I should just erase the Roman alphabet letters. Or would it be better to insert the proper ideogram back in? If so, how do I do that? 
Line 121: Line 124:
  
 A: You should insert the proper ideogram back in.  You can do this using unicode. Here are [[public:nnels:etext:symbols#using_unicode_for_symbols|the instructions on how to set that up]]--but remember, some [[public:nnels:etext:language#working_with_images_of_words_and_different_alphabets|languages are too complex for this technique]].  If you feel confident you can insert the correct ideogram, the do so.  Remember, we **never** have text as images, even if it is in another alphabet.  A: You should insert the proper ideogram back in.  You can do this using unicode. Here are [[public:nnels:etext:symbols#using_unicode_for_symbols|the instructions on how to set that up]]--but remember, some [[public:nnels:etext:language#working_with_images_of_words_and_different_alphabets|languages are too complex for this technique]].  If you feel confident you can insert the correct ideogram, the do so.  Remember, we **never** have text as images, even if it is in another alphabet. 
 +----
 Q: That's the thing, I don't know how to find the correct Japanese ideogram in Unicode. I don't even know which Japanese alphabet to search in - apparently there are several. I don't feel at all confident that I can identify the correct symbol. I know how to insert symbols with Unicode - the missing part is how to identify the specific code for the correct Japanese symbol. I think it would be one of the CJK Unified Ideographs but I don't know which one and I can't just search "mountain" to find the correct one. The instructions you point to on the wiki don't explain that part. To me, this falls under "Some languages cannot be transcribed due to the complexity of that language" which is why I was wondering if I should find a work-around to still include the symbols for people who do understand Japanese. Or, just leaving the symbols out since the English translation as well as the English pronunciation of the Japanese word are both included. Q: That's the thing, I don't know how to find the correct Japanese ideogram in Unicode. I don't even know which Japanese alphabet to search in - apparently there are several. I don't feel at all confident that I can identify the correct symbol. I know how to insert symbols with Unicode - the missing part is how to identify the specific code for the correct Japanese symbol. I think it would be one of the CJK Unified Ideographs but I don't know which one and I can't just search "mountain" to find the correct one. The instructions you point to on the wiki don't explain that part. To me, this falls under "Some languages cannot be transcribed due to the complexity of that language" which is why I was wondering if I should find a work-around to still include the symbols for people who do understand Japanese. Or, just leaving the symbols out since the English translation as well as the English pronunciation of the Japanese word are both included.
  
Line 130: Line 133:
 Q: I am editing an illustrated children's book that has a sentence where I think I need to indicate a foreign language. It is just a single word but it is clear that a change in language is intended (Page 3 of The Gathering by Theresa Meuse). I tried to follow the instructions for creating a new style but the Mi'kmaw language is not one of the language options. What should I do? Q: I am editing an illustrated children's book that has a sentence where I think I need to indicate a foreign language. It is just a single word but it is clear that a change in language is intended (Page 3 of The Gathering by Theresa Meuse). I tried to follow the instructions for creating a new style but the Mi'kmaw language is not one of the language options. What should I do?
  
-A: Unfortunately, there are currently no language tags for that language.  What you can do is put a Producer's Note in the book with something like "This book includes words and phrases in Mi'kmaw language. Text-to-speech software will not be able to pronounce these words and phrases correctly in the Word version."+A: Unfortunately, there are currently no language tags for that language.  What you can do is put a Producer's Note in the book with something like "This book includes words and phrases in Mi'kmaw language. Text-to-speech software will not be able to pronounce these words and phrases correctly."
 ---- ----
  
Line 143: Line 146:
 We will need to translate the images into Unicode. We will need to translate the images into Unicode.
 If you're using Mac, enable your "Unicode Hex Input" keyboard (see Language section in wiki for instructions). To type each symbol/letter into Word, hold down the ''alt'' key and type the 4-digit number, i.e. ''1400'' If you're using Mac, enable your "Unicode Hex Input" keyboard (see Language section in wiki for instructions). To type each symbol/letter into Word, hold down the ''alt'' key and type the 4-digit number, i.e. ''1400''
- 
----- 
- 
-**Q: I am editing a poetry book that uses Italian, French, and Latin.  If I apply a language to one word, it changes the entire line or stanza.  Should I just leave it as poetry style?** 
- 
-A: Unfortunately, identifying languages in Word doesn't translate well to DAISY XML and requires manual editing of language tags in the XML. You can just leave the Word version without language markup and use just the poetry style. Just make a note in the RT ticket that there are multiple languages. 
- 
- 
----- 
- 
-**Q: I have a book that deals with hebrew words.  Some of the words are typed, and I can create a style for them, but the other words are images of just a letter, or an entire word.  How should I deal with them?  Should I just put in the alt-text this is an image of this hebrew letter/word? Or should I put in a producers note?  I included examples below:** 
-  
-{{:public:nnels:etext:screen_shot_2017-10-23_at_12.24.58_pm.png?nolink&200|}} 
- 
-{{:public:nnels:etext:screen_shot_2017-10-23_at_12.27.41_pm.png?nolink&200|}} 
- 
-{{:public:nnels:etext:screen_shot_2017-10-23_at_12.28.38_pm.png?nolink&200|}} 
- 
-A: Using images instead of text is a very bad publishing practice :( Images of text should all be converted to text in the body of the narrative. We should type out all the text including the Hebrew and Greek text and use a style to tag them as words in the Hebrew or Greek language (as we usually do with foreign language words). 
- 
-In case you're not sure how to type in different languages, this is how you do it on a Mac [[https://support.office.com/en-us/article/Enable-keyboard-layouts-in-different-languages-in-Office-for-Mac-687f804e-4421-4a73-94b3-3febb538a7a1|Enable keyboard layouts in different languages in Office for Mac]] and [[https://support.office.com/en-us/article/Enable-or-change-a-keyboard-layout-language-1c2242c0-fe15-4bc3-99bc-535de6f4f258|Windows]]. 
- 
  
 ---- ----
 WCAG 2.0 - H58:[[https://www.w3.org/TR/WCAG20-TECHS/H58.html|Using language attributes to identify changes in the human language]] WCAG 2.0 - H58:[[https://www.w3.org/TR/WCAG20-TECHS/H58.html|Using language attributes to identify changes in the human language]]
public/nnels/etext/language.txt · Last modified: 2022/11/24 10:27 by rachel.osolen