It is based on Arabic script, but it has quite a few differences. There are a co...

marcosdumay · on Nov 17, 2019

> Since Iran is not a signatory to any international copyright treaties

Getting off from a tangent, how does that work? Your copyrights are ignored on any other country? Or do your people do something to get some kind of "international copyrights"?

I imagine it does not make much difference for patents, is that right?

smnrchrds · on Nov 17, 2019

> Your copyrights are ignored on any other country?

Essentially yes. If a work is produced outside of Iran, it does not have any copyright protection in Iran, vice versa.

As an example, since Harry Potter was quite popular in Iran, multiple (at least six IIRC) publishers translated it to Persian for the Iranian market. One publisher could not take another one's translated version and re-print it—the translated version was produced in Iran and enjoyed copyright protection in Iran. But the original English version was fair game for anyone.

smnrchrds · on Nov 17, 2019

Just in case it is not quite clear what Borna Rayaneh did to fonts to add Persian support: they essentially took Arabic fonts and wingdinged them until they looked Persian.

Also the sentence saying "But the font shows it as ك" should read "But the font shows it as ک".

apta · on Nov 18, 2019

> (ی vs ي)

Arabic has both, but they're pronounced differently from Farsi. (ي) is a (y) sound (like seed) whereas (ی) is either an (a) sound (like bat) or an (ay) sound (like may).

smnrchrds · on Nov 19, 2019

> Arabic has both

Not really. Arabic has U+0649 (Arabic Letter Alef Maksura), while Farsi has U+06CC (Arabic Letter Farsi Yeh). They look similar, even identical depending on the font, as long as they are standalone. When they are in a word though, it gets more complicated.

The important difference between U+0649 and U+06CC is how they look when they are connected to other letters. The former is always dotless. The latter is only dotless when it is not connected to another letter from the left. Here is an example:

U+0649 (Arabic): ى لى ىد لىد

U+06CC (Farsi): ی لی ید لید

It's kinda similar to how Turkish I's are not the same as English I's. English capital vs small form is different from the Turkish one, so different code points is necessary:

English: I i

Turkish (dotless): I ı

Turkish (dotted): İ i

Because Turkish uses separete letters for capital and small letters, only the different forms have their own codepoints. Because in Farsi and Arabic different forms of letters are implemented as ligatures, you need a different codepoint for each of them. You cannot reuse standalone U+0649 for U+06CC.

So to recap, Turkish has dotted İ and dotless I and they always retain their dot status. English has one I that will be written with or without a dot depending on how it is placed in the sentence.

Arabic has dotted ي and dotless ى and they always retain their dot status. Farsi has one ی that will be written with or without dots depending on how it is placed in the word.

apta · on Nov 20, 2019

Makes sense. Historically, all Arabic letters were dotless as you probably know. I wonder if this made it into Farsi script somehow, for this case at least.

solstice · on Nov 17, 2019

Thank you for the fascinating explanation and story