Sentence Break Chart
Unicode Version: 4.1.0
Date: 2005-03-29, 01:31:34 GMT
| Sep | Format | Sp | Lower | Upper | OLetter | Numeric | ATerm | STerm | Close | Other |
Sep | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ |
Format | × | × | × | × | × | × | × | × | × | × | × |
Sp | × | × | × | × | × | × | × | × | × | × | × |
Lower | × | × | × | × | × | × | × | × | × | × | × |
Upper | × | × | × | × | × | × | × | × | × | × | × |
OLetter | × | × | × | × | × | × | × | × | × | × | × |
Numeric | × | × | × | × | × | × | × | × | × | × | × |
ATerm | × | × | × | × | × | ÷ | × | ÷ | ÷ | × | ÷ |
STerm | × | × | × | ÷ | ÷ | ÷ | ÷ | ÷ | ÷ | × | ÷ |
Close | × | × | × | × | × | × | × | × | × | × | × |
Other | × | × | × | × | × | × | × | × | × | × | × |
Rules
- 1: sot ÷
- 2: ÷ eot
- 3: Sep ÷
- 4: GC -> FC
- 5: X Format* -> X
- 6: ATerm × ( Numeric | Lower )
- 7: Upper ATerm × Upper
- 8: ATerm Close* Sp* × ( ¬(OLetter | Upper | Lower) )* Lower
- 9: ( Term | ATerm ) Close* × ( Close | Sp | Sep )
- 10: ( Term | ATerm ) Close* Sp × ( Sp | Sep )
- 11: ( Term | ATerm ) Close* Sp* ÷
- 12: Any × Any
Sample Strings
-
( " G o . " ) ( H e d i d . )
-
( “ G o ? ” ) ( H e d i d . )
-
U . S . A ◌̀ . i s
-
U . S . A ◌̀ ? H e
-
U . S . A ◌̀ .
-
3 . 4
-
c . d
-
e t c . ) ’ ‘ ( t h e
-
e t c . ) ’ ‘ ( T h e
-
t h e r e s p . l e a d e r s a r e
-
字 . 字
-
e t c . 它
-
e t c . 。
-
字 。 它
-
□ ( □ " □ G □ o □ . □ " □ ) □ □ ( □ H □ e □ □ d □ i □ d □ . □ ) □ □
-
□ ( □ “ □ G □ o □ ? □ ” □ ) □ □ ( □ H □ e □ □ d □ i □ d □ . □ ) □ □
-
□ U □ . □ S □ . □ A □ ◌̀ . □ □ i □ s □ □
-
□ U □ . □ S □ . □ A □ ◌̀ ? □ □ H □ e □ □
-
□ U □ . □ S □ . □ A □ ◌̀ . □ □
-
□ 3 □ . □ 4 □ □
-
□ c □ . □ d □ □
-
□ e □ t □ c □ . □ ) □ ’ □ □ ‘ □ ( □ t □ h □ e □ □
-
□ e □ t □ c □ . □ ) □ ’ □ □ ‘ □ ( □ T □ h □ e □ □
-
□ t □ h □ e □ □ r □ e □ s □ p □ . □ □ l □ e □ a □ d □ e □ r □ s □ □ a □ r □ e □ □
-
□ 字 □ . □ 字 □ □
-
□ e □ t □ c □ . □ 它 □ □
-
□ e □ t □ c □ . □ 。 □ □
-
□ 字 □ 。 □ 它 □ □