Did you find this page useful? Give us feedback.
Did you find the information
on this page useful?

Yes
Somewhat
No
  Tell us more about your choice:
Missing information
Incorrect information
Misleading information
I have a question
Other
  Enter your comment in the box below.
If you need assistance, please contact Support instead.

  Type the letters
simple PHP captcha
 

The Edit segmentation rule set dialog allows you to make changes to the selected set of segmentation rules.  It is important that you cannot edit the default set of segmentation rules; first you have to clone it and then you can edit the cloned copy. See also: How to create a new segmentation rule.

How to begin

You can invoke the Edit segmentation rule set dialog from the Segmentation rules category of the Settings pane in Project home, or the Segmentation rules pane of the Resource console, by selecting the set you wish to edit, and clicking the Edit command.

The Edit segmentation rule set dialog consists of two tabs: Segmentation and Custom lists.

Segmentation tab

This tab allows you to see and edit the rules used for segmentation of imported translation documents. Adjust the default segmentation rules according to your needs.

Simple view

Simple view options:

Do not segment after numbers followed by a period: Check this check box to not segment after numbers followed by a period. If you do not check this check box, then memoQ will segment after numbers, except if the number is followed by an item that you added in the Abbreviation after number list. If you check this check box, then memoQ will not segment after numbers which are followed by a period (without exceptions).

Abbreviations are case sensitive: Check this check box to segment based on case sensitivity. This check box is checked by default.

Order lists alphabetically: Check this check box to sort all abbreviations lists in alphabetical order. If you do not check this check box, then all items are shown in the order you entered them in the segmentation rule set. The list is always saved in the original order, even if you sort the list.

Click the Advanced view link to show the advanced options:

Advanced view

The Segmentation tab has two sections, Rules on the left and Exceptions on the right. Each of these sections contains a list view showing the current rules and a text field that is used to enter a new rule or modify an existing one.

Note: Before you edit the custom lists or their items, make sure you are familiar with the concept of segmentation and the logic of regular expressions.

Rules section:

Add: Click this button to add a segmentation rule after entering the rule to the text field below the list view.

Change: Click this button to modify a rule after selecting it in the list view and editing it in the text field below.

Delete: Click this button to delete a rule after selecting it in the list view.
 

Exceptions section:

Add: Click this button to add a segmentation exception rule after entering the rule to the text field below the list view.

Change: Click this button to modify an exception rule after selecting it in the list view and editing it in the text field below.

Delete: Click this button to delete an exception rule after selecting it in the list view.

Preview: Click this button to open the Segmentation preview dialog, which shows how sample text is segmented if the current segmentation rules are applied.

Custom lists tab

This tab allows you to see and edit custom lists. Customs lists are a customizable extension of regular expressions used to define segmentation rules on the Custom lists tab.

For example, you would write up custom lists to tell memoQ about abbreviations that end in a period, but they do not mark the end of a sentence - so memoQ should not cut the segment after them.

Caution: memoQ allows users to add abbreviations directly from the translation grid. This does not work if you delete any of the default custom lists from the segmentation rule. In general, you should not delete default custom lists here, but you are still free to add new ones.

The tab has two sections, Custom lists on the left and List items on the right. Each of these sections contains a list view and a text field. In the Custom lists section, the list view shows the existing custom lists, while the text field below is used to enter a new custom list or modify an existing one. In the List items section, the list view shows the existing custom list items, while the text field below is used to enter a new item or modify an existing one.

Note: Before you edit the custom lists or their items, make sure you are familiar with the concept of segmentation rules. Segmentation rules affect the way how your document gets segmented. You need to assign your segmentations rules before you import your documents to take effect.

Custom lists section:

Add: Click this button to add a custom list after entering its name to the text field below the list view.

Change: Click this button to modify a custom list after selecting it in the list view and editing it in the text field below.

Delete: Click this button to delete a custom list after selecting it in the list view.
 

List items section:

Add: Click this button to add a list item after entering it to the text field below the list view.

Change: Click this button to modify a list item after selecting it in the list view and editing it in the text field below.

Delete: Click this button to delete a list item after selecting it in the list view.

Preview: Click this button to open the Segmentation preview dialog, which shows how a few sample sentences are segmented if the current segmentation rules, which may use custom lists, are applied.

When you click each custom lists item, you see a list of items in the List items pane. English for example:

#cap# contains all letters of the Latin Alphabet, capitalized.

#end# contains all punctuation marks at the end of a segment. If you want to segment after a semicolon, add the semicolon to the list items. Enter the semicolon in the text field below the List items, then click Add. memoQ will then also segment after a semicolon.

#abbr_long# contains all abbreviations for English with more than 2 characters. For instance add "acc." to the List items, and memoQ will not segment after "acc.", and treat it like an abbreviation and not as segment end.

#abbr_short# works the same way as #abbr_long#.

#abbr_num_long# contains the abbreviations for each month like Feb., Tel., min., etc. memoQ will not segment when it detects these abbreviations in the text. You can also add for instance Thur., Fri., etc. depending on your text type.

#lpar# contains all left/opening characters like (.

#rpar# contains all right/closing characters like ).

#roman# contains roman numbers like II., VI., etc.

 

Example:

You have a document that contains titles like "Prof. Dr. Max Mustermann". Using the default segmentation rules, memoQ would not import it as one segment, but would split after "Prof.". You can easily fix it in adjusting the custom lists in your segmentation rules. Add "Prof." to the abbreviation list (#abbr_long#). You need to import your document again to see the changes of the segmentation rule affected in your document. memoQ will now no longer split the "Prof. Dr. Max Mustermann" segment.

Importing and exporting segmentation rules

memoQ supports the Segmentation Rule eXchange (SRX) standard that enables the exchange of segmentation rules between different tools. However, memoQ's segmentation exception handling is more sophisticated than that described in SRX, and therefore two export options are available, one that exports into memoQ and one that exports into other tools.

Import SRX: Click this button to import segmentation rules from an SRX file into the current rule set.

Export SRX: Click this button to export the current segmentation rule set into an SRX file.

o Optimized export: Select this radio button to export segmentation rules for an SRX-enabled tool other than memoQ, and specify where you want to save the SRX file.

o Detailed export: Select this radio button to export segmentation rules for another copy of memoQ, and specify where you want to save the SRX file.

Navigation

Click OK to close the dialog, saving all changes. Clicking Cancel will close the dialog without saving changes.

 

See also:

Add abbreviation

Find abbreviation