0

I wrote a web page with one Russian word in it, using the Cyrillic alphabet character entity names (found in a list of all Unicode characters that I have). The page works fine in every browser I tried. But the W3C validator won't validate the page because the Cyrillic character entity names were used. It validates if I use the Unicode numbers instead, but that is such clanky programming. It reminds me of having to type in the ASCII code for every character the program displays in assembly language programming.

Why can't we have the better programming practices? W3C seems to want to deprecate all but 5 of the character entity names. This sounds like going back to the stone age.

3
Contributors
14
Replies
38
Views
1 Year
Discussion Span
Last Post by diafol
Featured Replies
  • try wrappingthe cyriliic in <span lang='ru'>АБВГДЕЖЅ</span> [w3c declaring language](https://www.w3.org/International/questions/qa-html-language-declarations#contentvsattribute) Read More

  • 1
    diafol 3,669   1 Year Ago

    Try a proper editor. Even notepad accepts. Read More

  • @diafol: his source/editor appears functional, the W3 validator picks a unicode fault thats hard to find, 'cause its not really there. Read More

0

Have you set encoding to utf-8?

Edit: sorry not sure I understood properly. WHat is it actually you need help with?

Could you give examples?

<!doctype html>
<html lang="en">
<head>
  <meta charset="utf-8">
  <title>Page Title</title>
</head>
<body>
АВДИ
</body>
</html>

Validated OK for me.

Edited by diafol

0

Btw the ???? Above are ABD and back to front N. Daniweb just ain.t up to displaying them in msgs.

0

I am using the doctype for xhtml. Does that make a difference?

Yes, I am using utf-8.

As an example, it did not recognize the character entity & YAcy; (the backward R) as valid.
It does recongize & #1071; (same character) as valid.
(I inserted a space after the & here to prevent encoding.)

Every browser I tried accepted & YAcy; and displayed the correct character.

Edited by MidiMagic: clarification

0

OK, any reason why you're using xhtml instead of html5?
Also you should be able to insert the characters directly, not use the HTML encodings - that would be ridiculous for anybody using anything other than English :)

0

OK, just read back the post again. You're using entity names (the ones starting with &...; like 'yacy'). Why are you using entity names instead of typing the actual characters directly? You can even use something as simple as charmap to do this.

BTW - I get 'yacy' to work and 'Yacy'.

<!doctype html>
<html lang="en">
<head>
  <meta charset="utf-8">
  <title>Page Title</title>
</head>
<body>
&yacy; &YAcy;
</body>
</html>

Edited by diafol

0

Aha. Yes I get the error message if using XHTML or HTML4. Change to HTML5 if you can. I only just realised what it was that was causing the issue - doh! Does the document need to be validated as XML? If not use HTML5.

Edited by diafol

0

My entire site is xhtml 1.0 strict as required by the webmaster for uniformity. It is the price I gladly pay for free hosting with unlimited storage.

I am still converting some of my old pages (not currently up) from html 4.0 (from when I had Geocities) to xhtml 1.0 so I can put them up (but doing it at my leisure, as new pages have my priority).

My native language is English. I do not have the ablity to type in the character directly because I do not speak or write Russian. I put this one word in to explain a connection between the Bible and a historic event.

0

I get little white rectangles when I paste the copied text. My editor can't accept the Cyrillic characters.

1

@diafol:
his source/editor appears functional, the W3 validator picks a unicode fault thats hard to find, 'cause its not really there.

Edited by almostbob

0

Ok, my last bit on this. Can't believe it's still on-going...

notepad.png

Why you can't just use charmap in Windows I don't know. You are creating a problem where one doesn't exist.

This topic has been dead for over six months. Start a new discussion instead.
Have something to contribute to this discussion? Please be thoughtful, detailed and courteous, and be sure to adhere to our posting rules.