programming python

Check if string contains only specific characters python

In Python, how to check if a string only contains certain characters?

I need to check a string containing only a..z, 0..9, and . [period] and no other character.

I could iterate over each character and check the character is a..z or 0..9, or . but that would be slow.

I am not clear now how to do it with a regular expression.

Is this correct? Can you suggest a simpler regular expression or a more efficient approach.

#Valid chars . a-z 0-9 
def check[test_str]:
    import re
    #//docs.python.org/library/re.html
    #re.search returns None if no position in the string matches the pattern
    #pattern to search for any character other then . a-z 0-9
    pattern = r'[^\.a-z0-9]'
    if re.search[pattern, test_str]:
        #Character other then . a-z 0-9 was found
        print 'Invalid : %r' % [test_str,]
    else:
        #No character other then . a-z 0-9 was found
        print 'Valid   : %r' % [test_str,]

check[test_str='abcde.1']
check[test_str='abcde.1#']
check[test_str='ABCDE.12']
check[test_str='_-/>"!@#12345abcde>> 
Valid   : "abcde.1"
Invalid : "abcde.1#"
Invalid : "ABCDE.12"
Invalid : "_-/>"!@#12345abcde>>reg.match['jsdlfjdsf12324..3432jsdflsdf']
True

but match[] doesn't return True

[2] For use with match[], the ^ at the start of the pattern is redundant, and appears to be slightly slower than the same pattern without the ^

[3] Should foster the use of raw string automatically unthinkingly for any re pattern

[4] The backslash in front of the dot/period is redundant

[5] Slower than the OP's code!

prompt>rem OP's version -- NOTE: OP used raw string!

prompt>\python26\python -mtimeit -s"t='jsdlfjdsf12324..3432jsdflsdf';import
re;reg=re.compile[r'[^a-z0-9\.]']" "not bool[reg.search[t]]"
1000000 loops, best of 3: 1.43 usec per loop

prompt>rem OP's version w/o backslash

prompt>\python26\python -mtimeit -s"t='jsdlfjdsf12324..3432jsdflsdf';import
re;reg=re.compile[r'[^a-z0-9.]']" "not bool[reg.search[t]]"
1000000 loops, best of 3: 1.44 usec per loop

prompt>rem cleaned-up version of accepted answer

prompt>\python26\python -mtimeit -s"t='jsdlfjdsf12324..3432jsdflsdf';import
re;reg=re.compile[r'[a-z0-9.]+\Z']" "bool[reg.match[t]]"
100000 loops, best of 3: 2.07 usec per loop

prompt>rem accepted answer

prompt>\python26\python -mtimeit -s"t='jsdlfjdsf12324..3432jsdflsdf';import
re;reg=re.compile['^[a-z0-9\.]+$']" "bool[reg.match[t]]"
100000 loops, best of 3: 2.08 usec per loop

[6] Can produce the wrong answer!!

>>> import re
>>> bool[re.compile['^[a-z0-9\.]+$'].match['1234\n']]
True # uh-oh
>>> bool[re.compile['^[a-z0-9\.]+\Z'].match['1234\n']]
False

answered Aug 24, 2009 at 23:12

John MachinJohn Machin

79.4k11 gold badges138 silver badges183 bronze badges

Simpler approach? A little more Pythonic?

>>> ok = "0123456789abcdef"
>>> all[c in ok for c in "123456abc"]
True
>>> all[c in ok for c in "hello world"]
False

It certainly isn't the most efficient, but it's sure readable.

answered Aug 24, 2009 at 16:26

Mark RushakoffMark Rushakoff

241k44 gold badges401 silver badges395 bronze badges

EDIT: Changed the regular expression to exclude A-Z

Regular expression solution is the fastest pure python solution so far

reg=re.compile['^[a-z0-9\.]+$']
>>>reg.match['jsdlfjdsf12324..3432jsdflsdf']
True
>>> timeit.Timer["reg.match['jsdlfjdsf12324..3432jsdflsdf']", "import re; reg=re.compile['^[a-z0-9\.]+$']"].timeit[]
0.70509696006774902

Compared to other solutions:

>>> timeit.Timer["set['jsdlfjdsf12324..3432jsdflsdf']


				
					

                 
	Bài Viết Liên Quan
	
	 	
		
		   
		   
		   
		
		
			Hướng dẫn python setter without getter

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn naming convention in python

		
	

		
		
		   
		   
		   
		
		
			What is the difference between expression and statement in python class 11

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn dùng oop php trong PHP

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn syntax trong python

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn unique python

		
	

		
		
		   
		   
		   
		
		
			Nhâm ngọ 2023 nữ mạng

		
	

		
		
		   
		   
		   
		
		
			Python setup.py install invalid syntax

		
	

		
		
		   
		   
		   
		
		
			How do i change date format from yyyy to mm dd in python?

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn encoding trong python

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn php list directories only

		
	

		
		
		   
		   
		   
		
		
			Print comma separated list python

		
	

		
		
		   
		   
		   
		
		
			How can i know my php username and password?

		
	

		
		
		   
		   
		   
		
		
			Hội chứng mạch vành mạn esc 2023

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn python flask tutorial

		
	

		
		
		   
		   
		   
		
		
			Hướng dẫn what is javascript binding?

		
	

		
		
		   
		   
		   
		
		
			Which function is used to remove all html tags from string passed to a form in php?

		
	

		
		
		   
		   
		   
		
		
			What type of programming language is python?

		
	

		
		
		   
		   
		   
		
		
			E commerce website in php mysql from scratch freecoursesite

		
	

		
		
		   
		   
		   
		
		
			How do you replace single quotes in python?

		
	

	
	




Toplist mới

 
	
	 
		#1
		
			Top 7 sự tích hồ gươm - ngữ văn lớp 6 2023
			6 tháng trước
		
	



	
	 
		#2
		
			Top 7 gdcd 6 bài 1 kết nối tri thức 2023
			6 tháng trước
		
	



	
	 
		#3
		
			Top 7 ý nghĩa của xây dựng gia đình văn hóa 2023
			6 tháng trước
		
	



	
	 
		#4
		
			Top 6 mẫu hợp đồng mượn đất làm nhà xưởng 2023
			6 tháng trước
		
	



	
	 
		#5
		
			Top 3 tổng tài biến thái tôi yêu anh tập 27 2023
			6 tháng trước
		
	



	
	 
		#6
		
			Top 6 kết thực phim mỹ nhân vô lệ 2023
			6 tháng trước
		
	



	
	 
		#7
		
			Top 9 trong những câu thơ sau câu nào sử dụng thành ngữ 2023
			6 tháng trước
		
	



	
	 
		#8
		
			Top 8 đề tài và chủ de của tác phẩm tắt đèn 2023
			6 tháng trước
		
	



	
	 
		#9
		
			Top 5 tiểu sử của thầy thích pháp hòa 2023
			6 tháng trước
		
	






		


	Bài mới nhất
	
	 	
		
		   
		   
		   
		
		
			Hạt giống nguyên chủng là gì năm 2024

		
	

		
		
		   
		   
		   
		
		
			De thi học kì 1 hóa 9 tphcm năm 2024

		
	

		
		
		   
		   
		   
		
		
			Hay mộng tinh là bệnh gì năm 2024

		
	

		
		
		   
		   
		   
		
		
			Nước ta bắt đầu khai thác dầu mỏ năm nào năm 2024

		
	

		
		
		   
		   
		   
		
		
			Phục hồi nút erase trong cs6 bị lỗi năm 2024

		
	

		
		
		   
		   
		   
		
		
			Nhung loai rau cu tot cho gan và tiên hóa năm 2024

		
	

		
		
		   
		   
		   
		
		
			Thu nhập trung bình của quản lý nhà hàng năm 2024

		
	

		
		
		   
		   
		   
		
		
			Kiến bu quần lót là hiện tượng gì năm 2024

		
	

	
	
                 
	Chủ Đề
	
	
	
		  programming
		  Hỏi Đáp
		  Toplist
		  Là gì
		  Bài Tập
		  Địa Điểm Hay
		  Mẹo Hay
		  Học Tốt
		  Nghĩa của từ
		  Công Nghệ
		  Khỏe Đẹp
		  bao nhiêu
		  Top List
		  Tiếng anh
		  Bao nhiêu
		  Sản phẩm tốt
		  Xây Đựng
		  Ngôn ngữ
		  javascript
		  Ở đâu
		  Đại học
		  Hướng dẫn
		  Bài tập
		  Tại sao
		  Dịch 
		  So Sánh
		  Máy tính
		  Món Ngon
		  mẹo hay
		  Bao lâu
		  Thế nào
		  So sánh
		  Khoa Học
		  Vì sao
		  Lớp 9
		  Lớp 10