简体   繁体   中英

remove un-wanted characters in email using preg-replace

I want to sanitize the email a#$%#$@b@#$#$2344324.com to a@b.com .
I tried and failed
echo filter_var("a#$%#$@b@#$#$2344324.com", FILTER_SANITIZE_EMAIL); //result: a#$%#$@b@#$#$2344324.com

I need to trim special characters in a email(sanitize to remove special characters). I used below code but I was unsuccessful.

$string = preg_replace("/^[a-zA-Z0-9._%+-]+@(?:[a-zA-Z0-9-]+\.)+[a-zA-Z]{2,4}$/", "", "a#$%#$@b@#$#$2344324.com");

echo $string;//result: a@b.com -- unwanted characters trimmed here.

There is already a RFC-based solution here: http://fightingforalostcause.net/misc/2006/compare-email-regex.php

function is_valid_email_address($email){

        $qtext = '[^\\x0d\\x22\\x5c\\x80-\\xff]';

        $dtext = '[^\\x0d\\x5b-\\x5d\\x80-\\xff]';

        $atom = '[^\\x00-\\x20\\x22\\x28\\x29\\x2c\\x2e\\x3a-\\x3c'.
            '\\x3e\\x40\\x5b-\\x5d\\x7f-\\xff]+';

        $quoted_pair = '\\x5c[\\x00-\\x7f]';

        $domain_literal = "\\x5b($dtext|$quoted_pair)*\\x5d";

        $quoted_string = "\\x22($qtext|$quoted_pair)*\\x22";

        $domain_ref = $atom;

        $sub_domain = "($domain_ref|$domain_literal)";

        $word = "($atom|$quoted_string)";

        $domain = "$sub_domain(\\x2e$sub_domain)*";

        $local_part = "$word(\\x2e$word)*";

        $addr_spec = "$local_part\\x40$domain";

        return preg_match("!^$addr_spec$!", $email) ? true : false;
    }

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM