|
I have found a solution for case 1:
^((0[\,\.]000[1])|([1-9]\d{0,3}(?:[\,\.]\d{1,4}))|(?:10000(?:[\,\.]0{1,4})))?$
However, the expression is not as difficult as is said here.
At the special request of a single person, the regular expression now also accepts "." and ",".
I program in QT and QLineEdit can be configured very nicely and effectively with regex expressions. I would therefore be very reluctant to deviate from this.
I am programming an LCR measuring bridge front end from Analog Devices (ADMX2001). The tDelay value range between 0.0000 and 65536.0000 is expected by the frontend and is also checked. And the value range does not fit into a uint16. Should I now discuss the specified value range with Analog Devices ? They must know in which way the bridge operates.
modified 9-Feb-24 13:37pm.
|
|
|
|
|
Dagobert1 wrote: At the special request of a single person, the regular expression now also accepts "." and ",".
That isn't what they said.
There are many users in many places that represent decimal numbers using a different form.
So you either do not want to support them with your solution or you do.
|
|
|
|
|
Here is the solution for case 1:
(ui >= 0.0000) & (ui <= 10000.0000):
^((0([\,\.]\d{0,4})?)|([1-9]\d{0,3}(?:[\,\.]\d{1,4})?)|(?:10000(?:[\,\.]0{1,4})))?$
(ui >= 0.0001) & (ui <= 10000.0000):
^((0[\,\.]000[1])|(0[\,\.]\d{0,3}[1-9])|([1-9]\d{0,3}(?:[\,\.]\d{1,4})?)|(?:10000(?:[\,\.]0{1,4})))?$
Maybe the solution will help someone else.
|
|
|
|
|
Dagobert1 wrote: Maybe the solution will help someone else.
I have been using regular expressions extensively for decades and the way I would solve the problem was already suggested in a previous post.
Parse the number into a floating point value and then validate it that way.
Even when I have needed to provide a configurable validation I have designed it that way.
It is not only less complex it is also going to be faster.
|
|
|
|
|
Now try any remotely-complicated range - eg: ui ≥ -19.4242 && ui ≤ 1337.4242 - and see how "easy" that is with a regex.
Parsing the value as an appropriate type and then checking the range is far simpler and faster. And as has already been pointed out, it will handle culture-specific formatting much better.
"These people looked deep within my soul and assigned me a number based on the order in which I joined."
- Homer
|
|
|
|
|
If all you are doing is trying to validate (in a QLineEdit) that a floating point number is in a particular range, why don't you use a QDoubleValidator with it? This allows you to set range values[^].
|
|
|
|
|
I have two scenarios in table one is name of person and another is cityname but both name of the person and cityname starts with capital letter and followed by small letters
I want different regular expression for cityname and name of person but both starts with capital letter and followed by small letters how can I differentiate them with regular expression.
|
|
|
|
|
This makes no sense at all. What you've said is you have Person names and City names, both using the sexact same format, a capital letter followed by lower-case letters. There is no way to "differentiate them", whatever that means, with a single RegEx.
You're going to have to do a better job of explaining what the data you're dealing with is like, and a much better explanation of what you mean by "differentiate them."
|
|
|
|
|
I am working with Data masking in test data management there we are using one tool called javelin workflow to extract data from XML which is inserted into database and
To read data we are using regular expression pattern for name and cityname.
But our requirement is city name like Birmingham should be masked as fixed value as Norwich and name of person should be masked as random letters, but we are using same regular expression pattern for both and data is not masking as we expected so we want to differentiate them with regular expression
|
|
|
|
|
If this is coming from an XML file, you SHOULD have fields in the XML specific to each type of name. If not, you have very badly malformed data in the file making it pretty much useless, unless there is another field in the same record telling you what type of name is in the record. Without that discriminator field, the data you're pulling from the XML is useless.
You cannot use a RegEx to distinguish between a person name and a city name in the same field. It's just not possible, even for a human to determine by hand.
|
|
|
|
|
XML element looks like for name of person
<ab ov="Jeff" v="Jeff" id="1">
And in the same XML for cityname
<ab ov="Birmingham" v="Birmingham" id="2">
And in same XML cityname having all caps letter like
<ab ov="BIRMINGHAM" v="BIRMINGHAM" id="3">
So suggest me regular expression for Cityname for both cases Birmingham and BIRMINGHAM because I want to proceed name for different function and cityname for different function in data masking tool.
And regular expression for name of person like "Jeff"
modified 8-Feb-24 11:29am.
|
|
|
|
|
You really are determined to ignore what you're being told, aren't you?!
It is literally impossible to use a regex to determine whether a sequence of characters refers to a person, a city, a talking kangaroo, or a Vulcan mating ritual.
Your only hope is to get a massive database containing the name of every single city on Earth, and hope that anything that's not in that list refers to a person. And even that's not foolproof - for example, "Paris" could be a city or a person. Without more details, you have no way of knowing.
The data you're trying to process is garbage. If you need to be able to tell whether the data refers to a person or a city, then you need to go back to the people providing the data and get them to add something into the data to distinguish the two. Assuming they know the difference in the first place!
"These people looked deep within my soul and assigned me a number based on the order in which I joined."
- Homer
|
|
|
|
|
|
|
Richard Deeming wrote: or a Vulcan mating ritual.
Are you sure? There probably could be more than one but I suspect the names are going to be pretty unique.
|
|
|
|
|
It's simply not possible. There is no expression that will be able to tell you whether the name is a person or a city. NONE AT ALL.
If you're trying to extract the names from the XML file, you DO NOT USE A REGEX FOR THIS. You create classes to hold each type of record and deserialize the XML into a data structure using those classes.
But, since you're get both city names and person names in the same record type (whatever "ab" means), there is no code you could ever write to tell you whether that is a person or a city.
|
|
|
|
|
Based on your description, it would seem that New York is not a valid name for either a person or a city. I'm pretty sure it is a city. So is Stoke-on-Trent . There's probably other names for both people and cities that don't fit your expected pattern.
Consider, is Regina a person or a city? I know people named Regina. I know of a city named Regina. How would you differentiate between the two?
I don't think that a regex is the right tool for this. I'm pretty sure both person and city names are far more complex than you've allowed for.
"A little song, a little dance, a little seltzer down your pants"
Chuckles the clown
|
|
|
|
|
It's impossible. Suppose you are just looking at surname and city, then my local city defeats this. Am I looking for the magician with the surname Durham[^], or the city in England[^]?
|
|
|
|
|
KiranKumar V 2024 wrote: regular expression for cityname and name of person
You stated in the other post
Go to ParentXML element looks like for name of person
<ab ov="Jeff" v="Jeff" id="1">
And in the same XML for cityname
<ab ov="Birmingham" v="Birmingham" id="2">
And in same XML cityname having all caps letter like
<ab ov="BIRMINGHAM" v="BIRMINGHAM" id="3">
As suggestion from another response it is NOT possible for you to determine from the above which is a city and which is a persons name.
HOWEVER, what you posted is not valid XML. It would seem possible to me that there are other XML elements that you can use.
But if not then I would immediately point out to whoever assigned this to you that it is NOT deterministic. A computer can NOT solve the problem correctly. Doesn't matter how you do it.
But with you posted the ONLY solution you have right now would be with the following.
- You must buy a city database. That is a product/service that one pays money for.
- You then use XML to parse the data. You do NOT use regular expressions to parse it.
- You look up the each value in the database. If you find it is a city. If you don't it is a name.
Following is an actual list of cities named after people. So of course these are the one that a computer cannot tell the difference. Actually human will not be able to tell it either.
https://en.wikipedia.org/wiki/List_of_places_in_the_United_States_named_after_people
Now in terms of other possibilities.
- There is in fact a person name AND city name in each record. So you could use that in combination with the above.
- As I said there are other elements/attributes in the XML that define exactly what it is.
- You can request that they change the XML to make it clear which is a city and which is a name.
|
|
|
|
|
jschell wrote: If you find it is a city. If you don't it is a name.
Paris[^]? Durham[^]? London[^]? Adelaide[^]? Etc.
There are plenty of examples that could be either a city or a name. Using not in the list of cities === person test might give you a good start, but its never going to be 100% accurate.
"These people looked deep within my soul and assigned me a number based on the order in which I joined."
- Homer
|
|
|
|
|
Richard Deeming wrote: but its never going to be 100% accurate.
Was the phrasing in my response not clear? I thought I was pointing that out in several places.
|
|
|
|
|
I am trying to create a filter for AH01264 errors. This is for bots trying to run standard "php"s or "pl"s off of my home server.
While I have figured out how enter and activate new filters in fail2ban, the regex required is way beyond my capabilities.
So was hoping someone here has done this before and could help me out.
Thank you
|
|
|
|
|
This forum is for regex in general. So specifics to fail2ban are not likely to succeed.
Googling I found other posts about this although I did not look into it deeply.
fail2ban ah01264
Based on that seems possible at least.
But in general "error messages" across all applications are seldom well regulated. Thus attempts to capture them either lead to too few or too many. Until one has enough actual examples to code to. Even then updates might change that.
For this forum you can post a regex and examples and we can correct the regex from that.
|
|
|
|
|
Good morning
I need to "purify" sentences to be able to use them in an app.
Thanks a lot for your help in helping me build the right code
I think the pattern is:
- 'any sentence' (0 or 1 time) {text here} (0-N times)
example
AB CD {xxxxx}
or AB CD {xxxxx,yyyy}
or {xxxxx}
or AB CD
or AB CD {xxxxx} AB CD {xxxxx}
or AB CD {xxxxx} {xxxxx} {xxxxx}
etc
- the {text here} block looks like {digit|some text}.
For example:
{1|xxxxxxxxx}
- the 'some text' block can (not mandatory) contain 'default=xxx' at any place in the text
ex: {3|abc=d,default=my value} or {2|a b c=d,default=my value,another=valueThatIDontNeed} or {1|default=my value}
I need to isolate the following parts and return them to a string.
- 'any sentence' text (if exists)
- xxx of the 'default=xxx' pattern, as per above explanation
This does not need to be done in one pass, I can script that in loops in Python for example.
Here are a few examples
Example 1
Store bulk masses greater than {0|message=<specify mass="" value="">|filter=^(_)?MASS_VALUE.+|add space after=false.+}{1|message=<specify mass="" unit="">|filter=^(_)?P413_MASS_UNIT.+} at temperatures not exceeding {2|message=<specify temperature="" value="">|filter=^(_)?TEMP_VALUE_.+|add space after=false.+}{3|message=<specify temperature="" unit="">|filter=^(_)?P413_TEMP_UNIT_.+}
this should give
Store bulk masses greater than at temperatures not exceeding
Example 2
Inhoud onder {0|message=<geschikt(e) vloeistof="" of="" gas="" specificeren="">|default=inert gas|filter=^(_)?P231_STORAGE_.+} gebruiken en bewaren. Tegen vocht beschermen.
Should give
Inhoud onder inert gas gebruiken en bewaren. Tegen vocht beschermen.
Example 3
EN CAS DE CONTACT AVEC LA PEAU: Laver abondamment{0|message=<préciser un="" produit="" de="" nettoyage="">|default=à l’eau|filter=^(_)?P352_WASH_.+}. Appeler immédiatement {1|message=<préciser qui="" pourra="" émettre="" comme="" il="" convient="" n="" avis="" médical="" en="" cas="" d’urgence="">|default=un CENTRE ANTIPOISON ou un médecin|filter=^(_)?P310_EMERGENCY_.+}.
Should give
EN CAS DE CONTACT AVEC LA PEAU: Laver abondamment à l’eau . Appeler immédiatement un CENTRE ANTIPOISON ou un médecin
Example 4
{0|message=<specificeren of="" dumpingvoorschriften="" van="" toepassing="" zijn="" op="" inhoud,="" container="">|default=Inhoud/verpakking|filter=^(_)?P501_REQUIREMENT_.+} afvoeren naar {1|message=<specificeer welke="" lokale="" regionale="" nationale="" internationale="" wetgeving="">|default=…|filter=^(_)?P501_DISPOSAL_.+}.
Should give
Inhoud/verpakking afvoeren naar … .
thanks !
|
|
|
|
|
I've been able to isolate text vs {} blocks in RegExr: Learn, Build, & Test RegEx[^] and Regex Tester and Debugger Online - Javascript, PCRE, PHP[^]
using
((?![{}])\w| )*|(({.*?}))
I'd then expert to use a python or php script to loop on groups , and for each group launch another regex to grep the "default=xxx" text only.
but using the same regex in python does not work (
The regex does not isolate block the same way as the 2 websites do
Looking for a bit of help here
Thanks so much
|
|
|
|