C++ HTML Encode Function

Question

campkev 0 Posting Pro in Training

18 Years Ago

looking for a function that will htmlencode a cstring

c c# c++ html-css

7 Contributors
10 Replies
3K Views
2 Weeks Discussion Span
Latest Post 18 Years Ago Latest Post by jwenting

All 10 Replies

John A 1,896 Vampirical Lurker

18 Years Ago

What do you mean by that? Simply put <p> tags around a string? If that's what you want, you can write your own very simply:

string htmlcode = "<p>"+paragraph+"</p>";

Likewise, searching and replacing newlines with <br/> is also very easy. Look up some of the string functions like string::find() and string::replace():
http://www.bgsu.edu/departments/compsci/docs/string.html

[edit]Or did you mean something like this... ;)[/edit]

Hope this helps

Dani 4,675 The Queen of DaniWeb

18 Years Ago

To HTML encode a string is to convert the " to " and < to < and > to > and & to & and you get the idea :) Essentially it's just a simple find and replace of those few things.

jwenting 1,905 duckman

18 Years Ago

To HTML encode a string is to convert the " to " and < to < and > to > and & to & and you get the idea :) Essentially it's just a simple find and replace of those few things.

hmm, except if the string represents PCDATA in which case it should be surrounded by <pre></pre> tags.

Reply to this topic

Be a part of the DaniWeb community

We're a friendly, industry-focused community of developers, IT pros, digital marketers, and technology enthusiasts meeting, networking, learning, and sharing knowledge.

campkev 0 Posting Pro in Training · Answer 1 · 2007-01-05T06:55:51+00:00

To HTML encode a string is to convert the " to " and < to < and > to > and & to & and you get the idea :) Essentially it's just a simple find and replace of those few things.

yeah, i know, was just looking for a function that was already written and handled all of them rather than doing them each individually. trying to do it the most laz.. I mean efficient way and use an existing function rather than write my own

WaltP 2,905 Posting Sage w/ dash of thyme Team Colleague · Answer 2 · 2007-01-05T08:41:55+00:00

...was just looking for a function that was already written and handled all of them rather than doing them each individually....

Nope, sorry. You have to be efficient on your own... ;)

campkev 0 Posting Pro in Training · Answer 3 · 2007-01-05T21:34:09+00:00

[edit]Or did you mean something like this... ;)[/edit]

that's what I am looking for, but a non-.NET version

Ravalon 62 Posting Whiz in Training · Answer 4 · 2007-01-05T22:36:09+00:00

that's what I am looking for, but a non-.NET version

I'm sure you could find something with google, but it's not hard to write your own either.

#include <algorithm>
#include <iostream>
#include <string>

#define array_length(array) (sizeof (array) / sizeof (array)[0])

namespace Raye {
  using namespace std;

  struct HTMLReplace {
    string match;
    string replace;
  } codes[] = {
    {"&", "&amp;"},
    {"<", "&lt;"}, 
    {">", "&gt;"}
  };

  string HTMLEncode( const string& s )
  {
    string rs = s;

    // Replace each matching token in turn
    for ( size_t i = 0; i < array_length( codes ); i++ ) {
      // Find the first match
      const string& match = codes[i].match;
      const string& repl = codes[i].replace;
      string::size_type start = rs.find_first_of( match );

      // Replace all matches
      while ( start != string::npos ) {
        rs.replace( start, match.size(), repl );
        // Be sure to jump forward by the replacement length
        start = rs.find_first_of( match, start + repl.size() );
      }
    }

    return rs;
  }
}

int main()
{
  using namespace std;

  cout << Raye::HTMLEncode( "template <class T> void foo( const string& bar );" ) << '\n';

  return 0;
}

Just add to the array when you want to handle another encoding, and be careful about encodings that are order sensitive. For example, the & encoding has to be done first because the others use & in the result. ;)

campkev 0 Posting Pro in Training · Answer 5 · 2007-01-05T22:42:46+00:00

I'm sure you could find something with google, but it's not hard to write your own either.

#include <algorithm>
#include <iostream>
#include <string>

#define array_length(array) (sizeof (array) / sizeof (array)[0])

namespace Raye {
  using namespace std;

  struct HTMLReplace {
    string match;
    string replace;
  } codes[] = {
    {"&", "&amp;"},
    {"<", "&lt;"}, 
    {">", "&gt;"}
  };

  string HTMLEncode( const string& s )
  {
    string rs = s;

    // Replace each matching token in turn
    for ( size_t i = 0; i < array_length( codes ); i++ ) {
      // Find the first match
      const string& match = codes[i].match;
      const string& repl = codes[i].replace;
      string::size_type start = rs.find_first_of( match );

      // Replace all matches
      while ( start != string::npos ) {
        rs.replace( start, match.size(), repl );
        // Be sure to jump forward by the replacement length
        start = rs.find_first_of( match, start + repl.size() );
      }
    }

    return rs;
  }
}

int main()
{
  using namespace std;

  cout << Raye::HTMLEncode( "template <class T> void foo( const string& bar );" ) << '\n';

  return 0;
}

Just add to the array when you want to handle another encoding, and be careful about encodings that are order sensitive. For example, the & encoding has to be done first because the others use & in the result. ;)

thanks, i'll try this.

King.Kong 0 Newbie Poster · Answer 6 · 2007-01-20T07:50:45+00:00

Not so fast. You have to take care of a long list of other entities too, like &pound; for £, etc. See here.

And if the & symbol is already part of an encoded literal, you don't want to encode it one more time.

King.Kong 0 Newbie Poster · Answer 7 · 2007-01-20T07:54:36+00:00

And all Unicode characters that cannot be represented within 8 bits have to be encoded too to &#xnnnn;

C++ HTML Encode Function

Recommended Answers Collapse Answers

All 10 Replies

Recommended Answers