Најди сличен содржај

Изворен канал @pythonotes · Post #239 · 3 мај

Один из самых удобных способов записать данные это использование готовых форматов, такие как JSON или YAML. Из плюсов такого подхода стоит отметить вот что: 🔸 готовый, повсеместно используемый и поддерживаемый формат 🔸 простой и понятный файл, удобочитаемый для человека 🔸 можно легко редактировать в любом текстовом редакторе без специальных программ и библиотек Но есть и минусы 🔹 затраты времени при записи файла (кодирование данных в нужный формат строки) 🔹 затраты времени при чтении файла (декодирование данных в Python объекты) 🔹 размер файла увеличивается из-за разметки данных (скобки, запятые, переносы, отступы...) 🔹 перед записью все данные должны быть помещены в память в полном объёме (не всегда) 🔹 при чтении необходимо считать весь файл в память и только потом декодировать данные Если нужно писать немного данных в несколько файлов, то затраты по времени не ощутимы. Обычно это файлы конфига или какие-либо метаданные. Это отличный вариант под такие задачи. Есть и другой поход к записи файлов - это бинарные файлы. Используется, когда данных достаточно много и никто их не собирается читать глазками😳. 🔸 очень быстрая запись 🔸 чтение значительно быстрей чем JSON, YAML итд 🔸 размер файла значительно меньше, так как нет разметки 🔸 можно записывать данные по мере поступления не загружая всё в память 🔸 можно извлечь любую часть данных независимо Из минусов 🔹 нужно определить свой формат записи данных (если не используете готовую спецификацию определённого формата) 🔹 не получится открыть файл и визуально понять что там записано, а для чтения файла потребуется знать его спецификацию. 🔹 не так-то просто создать такой файл без специальной библиотеки В таком виде удобно записывать большой массив любых однородных данных. Например, мониторинг валютной биржи или кэшированная анимация 3D геометрии. (Это не означает что нельзя записать данные разного типа, просто это будет не так удобно) Представьте себе JPG-картинку. По сути это немного мета-информации и большой массив пикселей. Тоже самое со звуком или видео файлом. Поэтому, если вы попробуете открыть картинку в текстовом редакторе вы увидите что-то вроде такого f15d cd29 a564 4578 ... 09e2 9bc4 a696 1253 ... 84e9 4de1 3b23 c24a ... 2534 5161 28e0 709d ... ... Это и есть записанные байтики. И для их чтения требуется определённый софт который знает что с ними делать. Под каждый тип файла. К чему это я? Читайте в следующем посте... #tricks#basic

Hashtags

#tricks #basic

Резултати

Пронајдени 5 слични објави

Пребарај: #datagovernance

当前筛选 #datagovernance清除筛选

AI & Law

@ai_and_law · Post #727 · 19.12.2025 г., 08:04

Најди слично Погледај

🇺🇸⚖️When Chat Logs Become Evidence After Death A new U.S. lawsuit alleges that OpenAI selectively withholds ChatGPT logs after a user’s death. The case concerns a murder–suicide involving Stein-Erik Soelberg, whose family claims ChatGPT reinforced paranoid delusions about his mother shortly before he killed her and himself. The estate argues that OpenAI refuses to disclose complete chat histories from the critical final days, despite relying on “full context” arguments in other suicide-related litigation. According to the complaint, fragments of chats recovered from social media show ChatGPT validating conspiracy beliefs, spiritual grandiosity, and hostility toward an identified individual. The family alleges a “pattern of concealment,” noting that OpenAI has no formal policy governing user data after death and retains chats indefinitely unless manually deleted. OpenAI has declined to explain why it will not produce the remaining logs, while stating publicly that it is improving safeguards and working with mental health clinicians. The lawsuit seeks punitive damages and an injunction requiring safeguards against validating paranoid delusions, as well as clearer public warnings about known risks. Beyond liability, the case raises unresolved questions about post-mortem data governance, evidentiary transparency, and the balance between user privacy and accountability when AI systems are implicated in real-world harm. #AIandLaw#AIEthics#DataGovernance

Hashtags

#aiandlaw #aiethics #datagovernance

AI & Law

@ai_and_law · Post #675 · 09.10.2025 г., 07:04

Најди слично Погледај

📖Generative AI Use Creates Massive Corporate Data Blind Spot A new Enterprise AI and SaaS Data Security Report 2025 by LayerX reveals that employees are regularly pasting sensitive information, including PII and PCI data, into generative AI tools like ChatGPT — often without company authorization. With 45% of enterprise employees using generative AI, 77% of them have copied and pasted data into chatbot queries, and 22% of those pastes involve sensitive personal or payment details. The security risk is compounded by the fact that 82% of these pastes come from unmanaged personal accounts, leaving companies with little to no visibility into what information is being shared. Similarly, 40% of file uploads to AI sites contain PII/PCI data, with nearly 39% originating from non-corporate accounts. This uncontrolled data flow creates a significant blind spot for enterprises, raising serious concerns about compliance, data governance, and legal accountability in AI usage. #AI#DataGovernance#Cybersecurity

Hashtags

#ai #datagovernance #cybersecurity

AI & Law

@ai_and_law · Post #252 · 01.03.2024 г., 08:04

Најди слично Погледај

OECD Establishes Expert Group for AI, Data, and Privacy Policy Synergies Hello everybody! The Organisation for Economic Co-operation and Development (OECD) has launched a new expert group aimed at fostering collaboration and coherence in artificial intelligence, data, and privacy policies. With a focus on finding common ground, the group seeks to bridge the gap between data governance and AI communities. By providing a platform for stakeholders to exchange insights and best practices, the OECD aims to address the complex intersection of AI, data, and privacy regulations. Through collective efforts, the group endeavors to develop guidelines and frameworks that promote responsible AI deployment while safeguarding individual privacy rights. #OECD#AI#DataGovernance#PrivacyPolicies

Hashtags

#oecd #ai #datagovernance #privacypolicies

AI & Law

@ai_and_law · Post #722 · 12.12.2025 г., 08:04

Најди слично Погледај

🇪🇺EU Opens Antitrust Probe into Google’s AI Training Practices The European Commission has launched an antitrust investigation into whether Google is using web content and YouTube uploads to train its AI systems without appropriate compensation, opt-out mechanisms, or equal access for competitors. Regulators are examining Google’s AI Overviews, AI Mode, and the use of YouTube content, noting that creators are required to grant Google permission for AI training without remuneration, while AI rivals are simultaneously blocked from using YouTube data for their own models. According to the Commission, the probe will assess whether Google imposes unfair terms on publishers and creators or grants itself privileged access to content in a way that may constitute abuse of dominance under EU competition rules. Google rejects the allegations, arguing that the inquiry risks slowing innovation and stating that tools like Google-Extended and robots.txt give publishers control, though the Commission noted concerns about the practical effects of blocking Google crawlers. T #AI#Antitrust#CompetitionLaw#DataGovernance#AIRegulation

Hashtags

#ai #antitrust #competitionlaw #datagovernance #airegulation

AI & Law

@ai_and_law · Post #341 · 28.06.2024 г., 07:04

Најди слично Погледај

Implementing Transparency in AI: A Step Forward Zuzanna Warso and Paul Keller from Open Future, alongside Maximilian Gahntz from Mozilla, have published a proposal to implement the EU AI Act’s training data transparency requirement for general-purpose AI (GPAI). Article 53 1(d) of the Act mandates GPAI model providers to publish detailed summaries of their training content, covering data sources and sets with narrative explanations. The proposed template emphasizes a comprehensive scope and sufficient technical detail to benefit both experts and laypeople. These summaries should list primary data collections, provide narrative explanations of other data sources, and clearly distinguish between 'data sources' (origins) and 'datasets' (processed data points). This transparency requirement aims to enhance accountability, enable research and scrutiny, and strengthen individuals' and organizations' ability to exercise their rights in the AI development process. #AI#Transparency#AIAct#DataGovernance#OpenFuture#Mozilla

Hashtags

#ai #transparency #aiact #datagovernance #openfuture #mozilla