Tuesday, June 8, 2010

Uncovering myths about Globalization testing- Approach to generate Localized test data- II

This post is in continuation of one of my previous posts on the subject of Localized test data generation. To recap the suggested approach, here it is-
- Come up with the appropriate classes of the data as per the structure of the language.
- Once the classes are identified, enture to pick up test data in a way that each class has a representation in the text data.

In the previous post, i gave an example about classes specific to Spanish language. I though to extend this further to include other European languages German and French as below.

French Language Input characters classes:
Capitals:
À,Â,Ä,È,É,Ê,Ë,Î,Ï,Ô,Œ,Ù,Û,Ü,Ÿ (commas are only used as separators)

Lower case:
à,â,ä,è,é,ê,ë,î,ï,ô,œ,ù,û,ü,ÿ

Punctuation
Ç,ç,«,»,€

Special French representations:
HTML entity codes (HTML entity codes are the codes which allow browsers and screen readers to process data as the appropriate language) e.g. for the character á, the HTML entity code is á
HTML entity codes for French language are listed here .

English Lower case characters:
a-z

English Upper case characters:
A-Z

Numeric representations:
1,2,3,4....

Special characters (EN):
~`!@#$%^&*()_+-={}[]|\:;"'<,>.?/

Any Known problematic French characters (not included above):

Some Real French Test data:
The purpose of this class is to simulate some of the real time test data as might be used by customers for some fields. Microsoft Terminology Translations offer a good source to simulate real strings that are used in Microsoft Terminologies.

**********************************************************************************************************

German Language Input characters classes:
Capitals:
Ä,Ö,Ü,ß,€ (commas are only used as separators)

Lower case:
ä,ö,ü

Punctuation
«,»,‹,›,„‚“‘”’–,—

Special German representations:
HTML entity codes (HTML entity codes are the codes which allow browsers and screen readers to process data as the appropriate language) e.g. for the character á, the HTML entity code is á
HTML entity codes for German language are listed here .

English Lower case characters:
a-z

English Upper case characters:
A-Z

Numeric representations:
1,2,3,4....

Special characters (EN):
~`!@#$%^&*()_+-={}[]|\:;"'<,>.?/

Any Known problematic German characters (not included above):

Some Real German Test data:
The purpose of this class is to simulate some of the real time test data as might be used by customers for some fields. Microsoft Terminology Translations offer a good source to simulate real strings that are used in Microsoft Terminologies.

Watch out this space further for Test data classes for Japanese, Chinese, Korean and other complex languages.

1 comment:

Anonymous said...

好文章給人的感覺就是很好,謝謝您~~ .................................................................