Најди сличен содржај

Изворен канал @pythonotes · Post #65 · 8 апр.

Небольшой трик с регулярными выражениями который редко вижу в чужом коде. Допустим, вам нужно распарсить простой текст и вытащить оттуда пары имя+телефон. Вернуть всё это надо в виде списка словарей. Возьмем очень простой пример текста. >>> text = ''' >>> Alex:8999123456 >>> Mike:+799987654 >>> Oleg:+344456789 >>> ''' Соответственно, для выделения нужных элементов будем использовать группы. Получится такой паттерн: (\w+):([\d+]+) Как мы будем формировать словарь из найденных групп? >>> import re >>> results = [] >>> for match in re.finditer(r"(\w+):([\d+]+)", text): >>> results.append({ >>> "name": match.group(1), >>> "phone": match.group(2) >>> }) >>> print(results) [{'name': 'Alex', 'phone': '8999123456'}, ...] Можно немного сократить запись используя zip >>> results = [] >>> for match in re.finditer(r"(\w+):([\d+]+)", text): >>> results.append(dict(zip(['name', 'phone'], match.groups()))) Но есть способ лучше! Это именованные группы в regex. Можно в паттерне указать имя группы и результат сразу забрать в виде словаря. >>> for match in re.finditer(r"(?P<name>\w+):(?P<phone>[\d+]+)", text): >>> results.append(match.groupdict()) То есть всё что я сделал, это добавил в начале группы (внутри сбокочек) такую запись: (?P<group-name>...) Теперь найденная группа имеет имя и можно обратиться к ней как к элементу списка >>> name = match['name'] Либо забрать сразу весь словарь методом groupdict() >>> match.groupdict() #tricks#regex

Hashtags

#tricks #regex

Резултати

Пронајдени 1 слични објави

Пребарај: #watermarking

当前筛选 #watermarking清除筛选

AI & Law

@ai_and_law · Post #256 · 07.03.2024 г., 08:04

Најди слично Погледај

Mozilla Foundation Study Raises Concerns on Watermarking AI Content Hello everyone! In a study released by the Mozilla Foundation, the challenges of identifying synthetic content online have been brought to light. Titled "In Transparency We Trust? Evaluating Watermarking and Labeling AI-Generated Content," the study delves into the effectiveness of various methods, including watermarking and labeling, in differentiating between synthetic and authentic content. The study, which conducted a comprehensive assessment of seven methods, both machine-readable and human-facing, revealed alarming findings: none of the methods were rated as "good," indicating significant hurdles in accurately identifying synthetic content. Despite efforts to implement watermarking and labeling, the study underscores the persistent difficulties faced in combatting the proliferation of AI-generated content. #MozillaFoundation#AIContent#Watermarking

Hashtags

#mozillafoundation #aicontent #watermarking