Portuguese Wikipedia language issues

Revision as of 03:06, 19 July 2011 by Vapmachado (talk | contribs) (*wiki)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
The ability to speak foreign languages

This page is intended to lead to the identification of the Portuguese Wikipedia language issues that have been haunting the Portuguese Wikipedia and appropriate solutions to those problems.

Please use either the English or Portuguese sections of the talk page for debate.

Language

Languages, Wikipedias and Regulation

Portuguese

Portugal: Academia das Ciências de Lisboa
Brazil: Academia Brasileira de Letras

Language codes

Language ISO 639-1 ISO 639-2 ISO 639-3
English en eng eng
Czech cs cze (B) ces (T) ces
Slovak sk slo (B) slk (T) slk
Polish pol pol pol
Kashubian - csb csb
Silesian - sla szl
Upper Sorbian - hsb hsb
Lower Sorbian - dsb dsb
Old Church Slavonic cu chu chu
Serbo-Croatian - - hbs
Serbian sr srp srp
Macedonian mk mac (B) mkd (T) mkd
Croatian hr hrv hrv
Slovene sl slv slv
Bosnian bs bos bos
Montenegrin - sla srp
Bulgarian bg bul bul
Russian ru rus rus
Belarusian be bel bel
Belarusian (Taraškievica) be-tarask ? ?
Ukrainian uk ukr ukr
Rusyn - sla rue

Portuguese

Language ISO 639-1 ISO 639-2 ISO 639-3
Portuguese pt por por

Language localisation

"Language localization (or localisation, in the UK and other Commonwealth countries) (from the English term locale, "a place where something happens or is set") is the second phase of a larger process of product translation and cultural adaptation (for specific countries, regions, or groups) to account for differences in distinct markets, a process known as internationalization and localization. Language localization is not merely a translation activity, because it involves a comprehensive study of the target culture in order to correctly adapt the product to local needs.

The localization process is most generally related to the cultural adaptation and translation of software, video games, and websites, and less frequently to any written translation (which may also involve cultural adaptation processes). Localization can be done for regions or countries where people speak different languages, or where the same language is spoken: for instance, [...], word choices and idioms vary among countries where English is the official language (e.g., in the United States, the United Kingdom, and the Philippines).

As the Localization Industry Standards Association (LISA) explains, globalization "can best be thought of as a cycle rather than a single process". To globalize is to plan the design and development methods for a product in advance, keeping in mind a multicultural audience, in order to avoid increased costs and quality problems, save time, and smooth the localizing effort for each region or country. Localization is an integral part of the overall process called globalization (What, [2010]).

There are two primary technical processes that comprise globalization, internationalization and localization.

[...]

The second phase, localization, refers to the actual adaptation of the product for a specific market. The localization phase involves, among other things, the four issues LISA describes as linguistic, physical, business and cultural, and technical issues.

At the end of each phase, testing, including quality assurance, is performed to ensure that [the] product works properly and meets the client's quality expectations.

Localization is often treated as a mere "high-tech translation", but this view does not capture its importance, its complexity or what it encompasses. Though it is sometimes difficult to draw the limits between translation and localization, in general localization addresses significant, non-textual components of products or services. In addition to translation (and, therefore, grammar and spelling issues that vary from place to place where the same language is spoken), the localization process might include adapting graphics; adopting local currencies; using proper forms for dates, addresses and phone numbers; the choices of colors; and many other details, including rethinking the physical structure of a product. All these changes aim to recognize local sensitivities, avoid conflict with local culture and habits, and enter the local market by merging into its needs and desires. For example, localization aims to offer country-specific websites of the same company, or different editions of a book depending on the place it is published.

Whereas localization is the process of adapting one product to a particular locale, globalization designs the product to minimize the extra work required for each localization.

Suppose someone is working for a company that, until now, has operated exclusively in [Portugal]. However, the company is now opening a major office in [Brazil], and needs a [Brazilian Portuguese]-language website. The company offers the same products and services in both countries, with only some minor differences, but perhaps some of the elements that appeared in the original website targeted at [Portugal] are offensive or upsetting in [Brazil] (use of flags, colors, nationalistic images, songs, etc.). Thus, that company might lose a potential market because of small details of presentation.

Furthermore, this company might need to adapt the product to its new buyers.

Now, suppose instead that this company has major offices in a dozen countries, and needs a specifically-designed website in each of these countries. Before deciding how to localize the website and the products offered in it any given country, a professional in the area might advise the company to create an overall strategy: to globalize the way the organization does business. The company might want to design a framework to codify and support this global strategy. The globalization strategy and the globalization framework would provide uniform guidance for the 12 separate localization efforts."

Considering:

The following codes should be used:

pt Portuguese

pt-AO Angola

pt-BR Brasil (Brazil)

pt-CV Cabo Verde (Cape Verde)

pt-GQ Guiné Equatorial (Equatorial Guinea) - unconfirmed

pt-GW Guiné-Bissau (Guinea-Bissau)

pt-MO Macau

pt-MZ Moçambique (Mozambique)

pt-PT Portugal

pt-ST São Tomé e Príncipe

pt-TL Timor Leste (East Timor)


Subtags

The syntax of language tags allows for:

  • an optional script subtag, composed of 4 letters only;
  • an optional region subtag, composed of 2 letters or 3 digits only;
  • optional variant subtags, each one composed of either:
    • 5 to 8 letters, or
    • one digit followed by 3 letters or digits;
  • optional extension subtags, each one composed of:
    • a single digit or a single letter with the exception of letter x, used as a singleton,
    • a single hyphen, and
    • 2 to 8 letters or digits;
  • an optional private use subtag, composed of a single letter x, followed by one or more of:
    • a single hyphen, followed by
    • 1 to 8 letters or digits;

"Subtags are not case sensitive, but the specification recommends using the same case as in the Language Subtag Registry, where region subtags are uppercase, script subtags are titlecase and all other subtags are lowercase. This capitalization follows the recommendations of the underlying ISO standards.

The use, interpretation (and matching) of language tags is currently defined in RFC 4647 (in combination with RFC 5646).

The variant subtags are not derived from any other standard (they are registered specifically in the IANA database according to the policy defined in the BCP 47 standard track)."

The IANA database of 2010-08-17 does not include any variant subtags for the Portuguese language.

"No extension subtags have yet been registered for now (they are reserved for future standardization)."

"The private use subtags are not registered in the IANA database as they are implementation-dependent and subject to private agreements between third-parties using them (these private agreements are out of scope of the BCP 47 standard track)."

From the IANA database of 2010-08-17, one may conclude that the year could be an appropriate way to distinguish the different Portuguese language orthographies:

  • pt-BR-x-1943 Portuguese as spoken and written in Brazil with orthography of 1943
  • pt-BR-x-1971 Portuguese as spoken and written in Brazil with orthography of 1971
  • pt-BR-x-2009 Portuguese as spoken and written in Brazil with orthography of 2009
  • pt-PT-x-1911 Portuguese as spoken and written in Portugal with orthography of 1911
  • pt-PT-x-1945 Portuguese as spoken and written in Portugal with orthography of 1945
  • pt-PT-x-1973 Portuguese as spoken and written in Portugal with orthography of 1973

Differences

English

Slavic

Portuguese

Keyboard layouts

 
Keyboard layout for Portuguese (Portugal)


 
Keyboard layout for Portuguese (Brazil)


  • List of the date and name of manufacturers that are planning to sell the same keyboard in both countries:
    • yy/mm - Name of manufacturer (source)

My preferences

  • User profile > Internationalisation > Language:
    • pt - Português
    • pt-BR - Português do Brasil
Internationalisation
Language
Allows you to specify the language in which the site interface will be displayed to you. See Wikipedia:Meta:Help:System messages. There are some limitations:
  • If the wiki's sidebar contains hard-coded custom labels, these are in effect for all interface languages, and will not change according to this setting.
[...]
  • The interface language does not affect namespace names; they are determinated by the site's main language. However, in links and in page names entered in the address bar of the browser, English namespace names, being the generic namespace names, are automatically converted to the local names.

System messages

Editing policy on the use of regional variants

Reference