User Tools


Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
public:nnels:etext:language [2020/06/03 09:08]
rachel.osolen
public:nnels:etext:language [2022/11/24 10:27] (current)
rachel.osolen [How to Work with Languages in Word]
Line 1: Line 1:
-======Text in other languages======+======Text in Other Languages======
  
 Whenever there is text in another language it’s very important to properly identify the language of the text. This ensures that screen readers, braille displays, and other assistive technologies can render the content accurately and read the content according to the pronunciation rules for that language. When no other language has been specified for a phrase or passage of text, its human language is the default human language of the book. Whenever there is text in another language it’s very important to properly identify the language of the text. This ensures that screen readers, braille displays, and other assistive technologies can render the content accurately and read the content according to the pronunciation rules for that language. When no other language has been specified for a phrase or passage of text, its human language is the default human language of the book.
Line 11: Line 11:
   - **Proper names**   - **Proper names**
     - Examples: Bellevue, Pierre     - Examples: Bellevue, Pierre
-  - **Technical terms**+  - **Technical and Scientific terms**
     - Examples: Homo sapiens, Alpha Centauri, hertz, and habeas corpus     - Examples: Homo sapiens, Alpha Centauri, hertz, and habeas corpus
     - Most professions require frequent use of technical terms which may originate from a foreign language. Such terms are usually not translated to all languages. The universal nature of technical terms also facilitate communication between professionals.     - Most professions require frequent use of technical terms which may originate from a foreign language. Such terms are usually not translated to all languages. The universal nature of technical terms also facilitate communication between professionals.
Line 39: Line 39:
   * Written Instructions for Mac: [[https://www.apple.com/voiceover/info/guide/_1124.html]]   * Written Instructions for Mac: [[https://www.apple.com/voiceover/info/guide/_1124.html]]
  
-=====Marking up Languages=====+=====How to Work with Languages in Word===== 
 +The following is a list of the different ways to mark up and approach languages in your workflow.
  
-To mark up secondary language: +  * [[.language:marking_up_languages|Marking Up Languages]]: How to mark languages in MS Word 
-  * Select the text +  * [[.language:when_an_entire_document_is_in_another_language|Full Document in Another Language]]: What to change the language for the full MS Word Doc 
-  * Go to ''Tools > Language'' +  * [[.language:indigenous_languages|Indigenous Languages]]: How to mark up Indigenous Languages. We do this differently that other languages 
-  * This will open a pop up menu +  * [[.language:working_with_languages_not_in_word|Languages Not in MS Word]]: What to do if the language is not in MS Word 
-  * Select the appropriate language+  * [[.language:working_with_images_of_words_and_different_alphabets|Images of Words and Languages in Different Alphabets]]: How to handle images of words and words typed in other alphabets
  
 +<note tip>If something is not clear, look at the Q&A Archive below. If you are still can not find an answer, ask on the [[public:nnels:etext:q_a|Production Q&A]]. There is never a bad question!</note>
 +----
  
  
-=====For entire documents written in another language=====+=====Q&A Archive=====
  
-If the entire book is written in another language, we will need to change the language of the document so that it is not English.  +Q: This question is about how we treat Indigenous languages. I have a book where there are just a few single words in an Indigenous language. They aren't phrases so I haven't marked them as per the new procedure. My question isdo I still include a Prod Note at the beginning identifying the language and explaining how they won't be pronounced correctly?
  
-To change the document language on a Mac, you can follow these steps:  +A: Yes. For Indigenous languages always include the producers note, and do your best to identify the language as precisely as you can.
-[[https://support.office.com/en-us/article/Check-spelling-and-grammar-in-a-different-language-in-Office-2016-for-Mac-0554be72-cd0e-49bd-a112-70ae2f0bf093|Change document language on a Mac]]+
  
-On a PC, Word should automatically detect the language of the document:  +----
-[[https://support.office.com/en-us/article/Check-spelling-and-grammar-in-a-different-language-667ba67a-a202-42fd-8596-edc1fa320e00|Change document language on a PC]]+
  
-=====Indigenous Languages=====+Q: Question regarding The Hanging of Angelique. THe text refers to Fala de Guine, a creole language that is a fusion of Portuguese and African languages. One paragraph of the text utilizes some specific terms from that language to describe Afro-Portuguese culture such as "mangana", "ye ye" and "zarambeque". I'm unsure of the proper language formatting for these terms... should I leave without language formatting or maybe utilize Portuguese?
  
-Currently, we are not able to create Language Style in Word for Indigenous LanguagesIn the future hope we will be able to create Language Styles, but for now we still want to be able to markup these languages so they are set apart from the surrounding text+A: A note about less common languages: If Word does not have language in its options for applying languages then we can not apply itNever apply a similar language as that would be incorrect.
  
-There are two steps for marking Indigenous Languages: +It is good to always check the wiki Language page, and check with me about languages as they can be very tricky sometimes (for example we don't have span tags for most Indigenous languages at this time, but Inuktitut has a set of unicode for its symbolics.)
-  - Apply Emphasis style to the words and phrases the same way you would a Language Style. +
-  - Insert a Producer's Note at the beginning of the text to inform the reader what Indigenous Languages are in the book, and that Text-To-Speech is unable to pronounce these words.+
  
-<note>It is important you try to include the proper names of the Indigenous Languages in the Producer's Note.  Where you can, also include the Tribe name. Sometimes this is clear in the book, and other times you may need to do a bit of research.  If you have any questions please contact the Project Coordinator +If you find there are a lot of these words, or longer phrases let me know and we can put in Producer's Note similar to the one we do for Indigenous languages.
-</note>+
  
-=====A note about poetry===== +Never hesitate to ask! 
-When you are working on poetry, you will **not** be able to apply a particular language style to words and phrases. In this case, you can just leave the Word version without language markup and use just the Poetry (''Poem (DAISY)'') style. Just make a note in the RT ticket that there are multiple languages.+
  
-=====Working with Images of Words and Different Alphabets=====+---- 
 +Q: In Svaha, there are some Mandarin phrases that are spelled out phonetically with the English alphabet. I've been trying to mark them as Strong and to set the language as Chinese but Word won't let me. (I click "okay" after choosing Chinese as the language and the pop up box closes but then the language is still marked as English). Since they're spelled out phonetically, I'm wondering if they even need to be marked? And if they do, do you have any suggestions as to why Word won't let me?
  
-Sometimes a word or phrase will appear as an image in line with the sentence instead of typed text. This is a issue from the publisher. Words or phrases should not be formatted as images, but sometimes publishers do not follow these guidelines. When this happens you will need to transcribe the image of the term of phrase, and then apply the language style. Be sure to delete the images once you are done adding the text version.+An example of the type of phrase I'm talking about is: "Wo hen hsiang chien t'a."
  
-<note>Some languages cannot be transcribed due to the complexity of that language. An example would be Arabic. When it comes to languages like Arabic, unless you are a native speaker you cannot transcribe it correctlyIn this case you would treat the image of the word like other images in the document and add Alt-Text stating it is an Arabic Word. You would then put Producers Note at the beginning of the book to explain why you did this. If you are unsure if the language is something you can safely transcribe please contact you supervisor for more feedback.</note>+Edited to add: Sure, I uploaded it to CyberduckI just want to add that I later came across lot of similarly spelled out Japanese phrases and was also unable to mark them as Japanese.
  
-Sometimes the terms or phrases are typed out in line with the rest of the text, but with a language that uses different alphabet. In this case, if the text appears as typed text, and not an image, then you can simply apply a language style to it as usual. +A: It seems to be an issue with Word since the words are written with english characters. Apply bold style to them, and I will add the language tags manually during conversion. Remember to leave note for me about the languages in a comment when you pass back the ticket in RT
  
 +----
  
-In case you're not sure how to type in different languages, this is how you do it on Mac [[https://support.office.com/en-us/article/Enable-keyboard-layouts-in-different-languages-in-Office-for-Mac-687f804e-4421-4a73-94b3-3febb538a7a1|Enable keyboard layouts in different languages in Office for Mac]] and [[https://support.office.com/en-us/article/Enable-or-change-a-keyboard-layout-language-1c2242c0-fe15-4bc3-99bc-535de6f4f258|Windows]].+Q: I was wondering if you could make me a video showing how to add Language mark-ups? I noticed that there is one section of the My Heart is Not Blind book that says word in Hawaiian, and another that uses the word “nonna”. I looked on the wiki and found these instructions:
  
-In other cases you can use ''unicode'' to enter the characters of the language. For more information on unicode go to the [[public:nnels:etext:symbols|Symbols]] page.+Marking up Languages 
 +To mark up secondary language: 
 + Select the text 
 + Go to Tools > Language 
 + This will open a pop up menu 
 + Select the appropriate language 
 + Apply Strong style to the word or phrase 
 +When passing the ticket to the Production Coordinator, please make note of what languages you used.
  
-=====Applying language styles for DAISY Reformatting===== 
  
-<note important>Do we want to keep this?</note>+So I tried this, however I don’t think I am doing it right. It’s telling me to download the Italian package for Word, for example, when I select the appropriate language. I think I may be in the wrong section...
  
-The language can be set using styles at either the **paragraph** or **character** levels. For entire paragraphs in a foreign language, we use a Paragraph style; for inline words or phrases in another language, we use a character style.+A: It sounds like you need to download the language package to your Word in order to use that language tag. Windows makes you take a few extra steps to add languages you haven't used before. I found this website that breaks down how to apply, addand use editing languages in Windows. Here is the link: [[https://www.customguide.com/word/how-to-change-language-on-word]] 
 +Note: The video at the top automatically stops to give you time to do the taskjust hit play again to have it start up again.
  
-For example, in the image belowwe can create new Character style (let'call the style Turkish) and set the language to Turkish using the Format drop-down menu and selecting Language+Rememberwe do not markup single words that are naturalized into English. Nonna is a word that is also pronounced the same in English, so no markup. The Hawaiian should be marked up, but there is no mark up for Hawaiian in Word .... which is just ...so wrong... Mark the word as strong so I can find it and see if I can add a tag in the code. This will not be read by screenreaders thoughso you can also add producer'note stating that the book has words in Hawaiian that will not be pronounced correctly by assistive technology.
  
-Following these steps will ensure that the text is spoken in the correct language, and converted into XML.+----
  
-<note important>Do not create a language style that is not in the language options for Word. In these cases, create a [[public:nnels:etext:producers-note|Producer's Note]] stating what the language is, and how TTS will not be able to pronounce it properlyFor Indigenous Languagesplease see the section below on Indigenous Languages. When in doubt, ask the Production Coordinator before proceeding.</note>+Q: I am currently working on "Americandreams.ca" which is French language play. I understand that for play you are to apply strong style to characters name and place the lines and stage directions in normal styleOne of the characters speaks English thoughso there are whole sections of the play and a large number of lines in the play that are written in english that may be jarring from the constant back and forthI am wondering if I still apply strong style and language markups to each of these sections and lines?
  
-====Step 1Create a new style (character or paragraph)====+AAlways mark up languages with Strong Style. This is done so I can locate the markup easily and check it. I then remove the strong style for language before conversion. This is noted in the wiki section for languages in the note at the end of the documentation for how to mark up languages. I have revised this note to be more clear. If there are large sections of English then they have to be marked up, as per the language rules. It is with single words and small phrases where we have to be more cautious, as this can be jarring. In a multi-lingual play, it is not surprising to have different languages, and it is more cumbersome to the listener to hear entire phrases mispronounced.
  
-{{:public:nnels:turkish.png?direct400|Create a style}} +----
- +
-====Step 2: Go to ''Language'' in the drop-down menu==== +
- +
-{{:public:nnels:language_menu.png?direct&400|Go to Language in the drop-down}} +
- +
-====Step 3: Set the language of the text==== +
- +
-{{:public:nnels:language_select.png?direct&200|Select the language}} +
- +
- +
- +
-=====Q&A=====+
  
 Q: I'm working on the play "1 Hour Photo." It contains a few Japanese characters but in the conversion, the characters were changed to Roman alphabet letters instead. The English translation is given for the symbols so I'm wondering if I should just erase the Roman alphabet letters. Or would it be better to insert the proper ideogram back in? If so, how do I do that?  Q: I'm working on the play "1 Hour Photo." It contains a few Japanese characters but in the conversion, the characters were changed to Roman alphabet letters instead. The English translation is given for the symbols so I'm wondering if I should just erase the Roman alphabet letters. Or would it be better to insert the proper ideogram back in? If so, how do I do that? 
Line 121: Line 115:
  
 A: You should insert the proper ideogram back in.  You can do this using unicode. Here are [[public:nnels:etext:symbols#using_unicode_for_symbols|the instructions on how to set that up]]--but remember, some [[public:nnels:etext:language#working_with_images_of_words_and_different_alphabets|languages are too complex for this technique]].  If you feel confident you can insert the correct ideogram, the do so.  Remember, we **never** have text as images, even if it is in another alphabet.  A: You should insert the proper ideogram back in.  You can do this using unicode. Here are [[public:nnels:etext:symbols#using_unicode_for_symbols|the instructions on how to set that up]]--but remember, some [[public:nnels:etext:language#working_with_images_of_words_and_different_alphabets|languages are too complex for this technique]].  If you feel confident you can insert the correct ideogram, the do so.  Remember, we **never** have text as images, even if it is in another alphabet. 
 +----
 Q: That's the thing, I don't know how to find the correct Japanese ideogram in Unicode. I don't even know which Japanese alphabet to search in - apparently there are several. I don't feel at all confident that I can identify the correct symbol. I know how to insert symbols with Unicode - the missing part is how to identify the specific code for the correct Japanese symbol. I think it would be one of the CJK Unified Ideographs but I don't know which one and I can't just search "mountain" to find the correct one. The instructions you point to on the wiki don't explain that part. To me, this falls under "Some languages cannot be transcribed due to the complexity of that language" which is why I was wondering if I should find a work-around to still include the symbols for people who do understand Japanese. Or, just leaving the symbols out since the English translation as well as the English pronunciation of the Japanese word are both included. Q: That's the thing, I don't know how to find the correct Japanese ideogram in Unicode. I don't even know which Japanese alphabet to search in - apparently there are several. I don't feel at all confident that I can identify the correct symbol. I know how to insert symbols with Unicode - the missing part is how to identify the specific code for the correct Japanese symbol. I think it would be one of the CJK Unified Ideographs but I don't know which one and I can't just search "mountain" to find the correct one. The instructions you point to on the wiki don't explain that part. To me, this falls under "Some languages cannot be transcribed due to the complexity of that language" which is why I was wondering if I should find a work-around to still include the symbols for people who do understand Japanese. Or, just leaving the symbols out since the English translation as well as the English pronunciation of the Japanese word are both included.
  
Line 130: Line 124:
 Q: I am editing an illustrated children's book that has a sentence where I think I need to indicate a foreign language. It is just a single word but it is clear that a change in language is intended (Page 3 of The Gathering by Theresa Meuse). I tried to follow the instructions for creating a new style but the Mi'kmaw language is not one of the language options. What should I do? Q: I am editing an illustrated children's book that has a sentence where I think I need to indicate a foreign language. It is just a single word but it is clear that a change in language is intended (Page 3 of The Gathering by Theresa Meuse). I tried to follow the instructions for creating a new style but the Mi'kmaw language is not one of the language options. What should I do?
  
-A: Unfortunately, there are currently no language tags for that language.  What you can do is put a Producer's Note in the book with something like "This book includes words and phrases in Mi'kmaw language. Text-to-speech software will not be able to pronounce these words and phrases correctly in the Word version."+A: Unfortunately, there are currently no language tags for that language.  What you can do is put a Producer's Note in the book with something like "This book includes words and phrases in Mi'kmaw language. Text-to-speech software will not be able to pronounce these words and phrases correctly."
 ---- ----
  
Line 142: Line 136:
 A: Looks like this is the Inuktitut language, according to the publication information. Inuktitut can be represented by [[https://en.wikipedia.org/wiki/Unified_Canadian_Aboriginal_Syllabics_(Unicode_block)|Unicode Canadian Aboriginal Syllabics]]. A: Looks like this is the Inuktitut language, according to the publication information. Inuktitut can be represented by [[https://en.wikipedia.org/wiki/Unified_Canadian_Aboriginal_Syllabics_(Unicode_block)|Unicode Canadian Aboriginal Syllabics]].
 We will need to translate the images into Unicode. We will need to translate the images into Unicode.
-If you're using Mac, enable your "Unicode Hex Input" keyboard (see Language section in wiki for instructions). To type each symbol/letter into Word, hold down the ''alt'' key and type the 4-digit number, i.e. ''1400''+If you're using Mac, enable your "Unicode Hex Input" keyboard (see Language section in wiki for instructions). To type each symbol/letter into Word, hold down the ''alt'' key and type the 4-digit number, i.e. ''1400''
 + 
 +Q: alfabet / alphabet features several words in Friesian, which is not included in the list of languages available in Microsoft Word. Should I format these as I would Indigenous words and leave a comment accordingly? 
 + 
 +A: Great question! Yes, you can treat it similarly to Indigenous languages on Turtle Island.
  
 ---- ----
 +Q: I'm working on Pilleurs de rêves, a French language book. It has a couple repeated references to Indigenous languages. One is the name of the people: anishinaabeg. Another is a single word said multiple times: nishin. I'll keep my eye out for more. In the meantime, I'm happy to add the strong style to these words, but I'm wondering if it would be possible to get a French translation of the producer's note for Indigenous languages on the Languages Wiki page.
  
-**Q: am editing a poetry book that uses Italian, French, and Latin.  If I apply a language to one word, it changes the entire line or stanza Should I just leave it as poetry style?**+A: Yes, you need to **always** bold all Indigenous words. will get that translation and let you know when the wiki has been updated.
  
-A: Unfortunatelyidentifying languages in Word doesn't translate well to DAISY XML and requires manual editing of language tags in the XML. You can just leave the Word version without language markup and use just the poetry style. Just make a note in the RT ticket that there are multiple languages.+---- 
 +Q: Another Thousand Times You Lose Your Treasure questionI know that we don't mark up words that have been incorporated into Englishbut I've noticed in this text that the author does not use the anglicized "pho," but instead writes it as "phở." Should I still leave this alone?
  
 +A: You can treat it the same way as the other Vietnamese words as below.
  
 ---- ----
 +Q: I'm working on A Thousand Times You Lose Your Treasure, which works in words in Vietnamese, and I noticed that Vietnamese isn't one of the default language packs. I found the guide in the Wiki for adding languages in Windows -- is there a similar go-to tutorial for Mac users?
  
-**Q: I have book that deals with hebrew words.  Some of the words are typed, and I can create a style for thembut the other words are images of just a letteror an entire word.  How should deal with them?  Should I just put in the alt-text this is an image of this hebrew letter/word? Or should put in a producers note?  I included examples below:** +AIt seems like you can not add Vietnamese to Word on Mac. submitted new question to the community message board, and will let you know if hear anything. For nowjust bold all the words, add a note to RT when you pass it to me that there are Vietnamese words in the documentand will manually put in the span tags when convert it to EPUB3. 
-  +----
-{{:public:nnels:etext:screen_shot_2017-10-23_at_12.24.58_pm.png?nolink&200|}}+
  
-{{:public:nnels:etext:screen_shot_2017-10-23_at_12.27.41_pm.png?nolink&200|}}+QWorking on Late to the House of Words by Sharon Dolin. Because it is a Spanish-English parallel text, is it correct that I apply language on each Spanish part entirely?
  
-{{:public:nnels:etext:screen_shot_2017-10-23_at_12.28.38_pm.png?nolink&200|}}+AYes, you want to apply Spanish to the full phrases. Remember in the future to include an example in your question. 
 +----
  
-AUsing images instead of text is a very bad publishing practice :( Images of text should all be converted to text in the body of the narrativeWe should type out all the text including the Hebrew and Greek text and use a style to tag them as words in the Hebrew or Greek language (as we usually do with foreign language words).+QAnother question about Late to the House of Words by Sharon Dolin. There are some italicised words on both the Spanish and English text. Strong style is already applied to the Spanish piecesIs clearing the format of the particular words and applying emphasis style the correct treatment to the italicised words in Spanish? E.g. Under [No conec cap espai], No conec cap espai que respongui al nom de //casa//
  
-In case you're not sure how to type in different languagesthis is how you do it on a Mac [[https://support.office.com/en-us/article/Enable-keyboard-layouts-in-different-languages-in-Office-for-Mac-687f804e-4421-4a73-94b3-3febb538a7a1|Enable keyboard layouts in different languages in Office for Mac]] and [[https://support.office.com/en-us/article/Enable-or-change-a-keyboard-layout-language-1c2242c0-fe15-4bc3-99bc-535de6f4f258|Windows]].+A: Yeswe follow the same rules for emphasis for other languages.
  
  
 ---- ----
 WCAG 2.0 - H58:[[https://www.w3.org/TR/WCAG20-TECHS/H58.html|Using language attributes to identify changes in the human language]] WCAG 2.0 - H58:[[https://www.w3.org/TR/WCAG20-TECHS/H58.html|Using language attributes to identify changes in the human language]]
 +
 +[[public:nnels:etext:start|Return to main eText Page]]
public/nnels/etext/language.1591200530.txt.gz · Last modified: 2020/06/03 09:08 by rachel.osolen