Fixed Issue #8068 - SVG encoding by sughandj · Pull Request #8415 · matplotlib/matplotlib

sughandj · 2017-04-01T03:58:00Z

SVG backend now supports special characters like won symbol with usetex.
#8068

Added test encoding issue in SVG backend

tacaswell · 2017-04-02T01:37:46Z

Does #8286 also fix the same problem?

sughandj · 2017-04-17T09:33:34Z

@tacaswell Hi, sorry for the late reply.
I checked out #8286 and tried to reproduce #8068, the warning is not there however, the Won character doesn't show up.
Looks like #8286 doesn't solve #8068.

tacaswell · 2017-04-17T12:48:11Z

-                           for i, c in enumerate(enc0.encoding)}
+
+                    # Make a list of each glyph by splitting the encoding
+                    enc0_list = []


this is the same as enc0_list = [e.split('/') for e in enc0.encoding] ?

Why do this splitting?

No, that'll make a 2D list, but we need 1D.

The encoding that is generated looks like this ['Grave/Acute/Circumflex/Tilde/Dieresis/Hungarumlaut/Ring.....']
Thus splitting at "/" gives us individual character names.
Not only that, each index actually corresponds to its character code
(eg: enc0_list[142] = 'uni20A9' which is the Won character)
Therefore, line 363-364, enumerates the list with i = character code and c = character name and creates a dictionary character code => font index
Later, in the code the glyph is retrieved using this dictionary

if enc: charcode = enc.get(glyph, None)

Hope this makes sense :)

how has this ever worked if that is the case?

tacaswell · 2017-04-17T12:51:37Z

+                        enc0_list += e.split("/")
+
+                    # Encoding provided by the font file mapping names to index
+                    enc = {i: font.get_name_index(c) or None


This is in a block for when charmap_name == "ADOBE_STANDARD", why change to not use the standard encoding?

I think the fix should probably be fixed above to select a better character map for the file?

Yes, this is the block where if charmap_name == "ADOBE_STANDARD" and font_bunch.encoding:
Since font_bunch already has Unicode values in them, we don't need to specially use the adobe standard file.
Thus, it was removed completely.

Then the conditional should be changed, not just silently internally ignored.

tacaswell · 2017-04-17T12:53:38Z


                if charcode is not None:
-                    glyph0 = font.load_char(charcode, flags=ft2font_flag)
+                    if use_glyph:


This should probably be merged up into the conditionals above to simplify the logic?

The charcode is set right before we reach this condition if charcode is not None:
Therefore, it seems like the right spot to decide which font method to use to load the charcode.

your right, I missed that there was a path to get a non-empty enc that did not set use_glyph to True.

Actually, this is very problematic, the code path above where you set use_glyph is a caching mechanism so the second time around this may have the wrong value of use_glyph?

tacaswell · 2017-04-17T12:57:13Z

Can you explain this changes a bit better? Assume I know nothing about how font encoding works :)

Could you include the changes from #8286 in this PR (or explain why they are wrong!)?

sughandj · 2017-04-18T10:55:24Z

@tacaswell I hope the replies to your comments give you more insight of the changes :)
Let me know if you have any other questions.

tacaswell · 2017-04-18T12:44:37Z

The test failures look real.

I am still extremely uncomfortable with this change because I do not understand it yet.

This seems to be drastically changing how this code works (by consuming the encoding from the font rather than forcing it to use the adobe character map) but is still leaving a bunch of the old machinery around leaving the code in a very confused state.

anntzer · 2018-12-03T15:28:55Z

+                    # Make a list of each glyph by splitting the encoding
+                    enc0_list = []
+                    for e in enc0.encoding:
+                        enc0_list += e.split("/")


Note: one needs to do e.decode("ascii").split("/") to test this PR on Py3.

anntzer · 2019-07-05T19:38:00Z

I think this has been superseded by #12928 (which owes much to this PR, thanks :)).

sughandj and others added 3 commits March 31, 2017 22:36

Fixed encoding issue in SVG backend

6b4827a

Added test encoding issue in SVG backend

3fad38d

Merge pull request #2 from Sunakujira1/patch-3

0122faf

Added test encoding issue in SVG backend

tacaswell added this to the 2.1 (next point release) milestone Apr 2, 2017

Fix unicode won test

49ad267

sughandj added 2 commits April 17, 2017 05:51

catch empty won def in unicode won test

41e1047

use savefig of fig instead of plt

c9f84b6

tacaswell reviewed Apr 17, 2017

View reviewed changes

use fig methods only in unicode won test

cee6250

tacaswell modified the milestones: 2.1 (next point release), 2.2 (next next feature release) Aug 29, 2017

anntzer mentioned this pull request Dec 2, 2018

ENH: try to use unicode charmap before ADOBE_STANDARD in textpath #8286

Closed

anntzer reviewed Dec 3, 2018

View reviewed changes

anntzer mentioned this pull request Dec 3, 2018

textpath encoding #12928

Merged

6 tasks

anntzer closed this Jul 5, 2019

story645 removed this from the future releases milestone Oct 6, 2022

Uh oh!

Conversation

sughandj commented Apr 1, 2017

Uh oh!

tacaswell commented Apr 2, 2017

Uh oh!

sughandj commented Apr 17, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sughandj Apr 18, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sughandj Apr 18, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tacaswell commented Apr 17, 2017

Uh oh!

sughandj commented Apr 18, 2017

Uh oh!

tacaswell commented Apr 18, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anntzer commented Jul 5, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sughandj Apr 18, 2017 •

edited

Loading

sughandj Apr 18, 2017 •

edited

Loading