Crab - A Python Framework for Building Recommendation SystemsMarcel Caraciolo
Crab is a Python framework for building recommendation engines. It began as a Mahout alternative for Python developers and is being rewritten as a Scikit-learn submodule. Crab currently features collaborative filtering algorithms and evaluation metrics. Developers are working in sprints to optimize performance by integrating Numpy and migrating Crab to work as a Scikit in order to make it faster and more accessible to the scientific community.
Neste tutorial apresentei usando Python Básico conceitos de como construir um sistema de recomendação por filtragem colaborativa.
Mutirão PyCursos:
Vídeo em: https://plus.google.com/u/0/events/c3hqbk20omt3r5uoq13gpk82i9g
The document describes Doonish, an online trivia game inspired by Trivial Pursuit. Players can create and answer questions in different categories. The game was first created in 2007 and redesigned in 2010. It is developed by a team of 4 programmers using technologies like a naive Bayes text classifier, tagging engine, and Wikipedia text integration to enhance the questions. The goal is to build the game using known technologies to minimize learning time.
Crab: A Python Framework for Building Recommender Systems Marcel Caraciolo
Crab is a Python framework for building recommendation engines. It began as a community-driven project one year ago and was incorporated into the open-source labs Muriçoca in April 2011. Crab is being rewritten as a Scikit (toolkit for machine learning in Python) to take advantage of the Scikit-Learn algorithms and infrastructure. The current version of Crab implements collaborative filtering algorithms like user-based, item-based, and matrix factorization and can evaluate recommender algorithms with metrics like precision, recall, and RMSE. It also provides APIs to build recommendation systems and deploy them using REST frameworks. Crab is already used in some production recommender systems.
Content Recommendation Based on Data Mining in Adaptive Social NetworksMarcel Caraciolo
The document discusses content recommendation in adaptive social networks based on data mining. It aims to design a methodology for social recommender systems that incorporate different knowledge sources from structured and unstructured data. The objectives are to design improved explanations for recommendations to increase user acceptance and enhance the student experience. The approach uses a hybrid recommender system that adapts the weighting of collaborative and content-based filtering based on the type of content being recommended. Current results show the system integrated into a Brazilian social network with over 70,000 students and items, with early user feedback being positive. Expected results include analyzing how recommendations can improve the learning process and exploring hidden knowledge in social networks.
Palestra sobre Computação Científica com Python, Scipy e Numpy ministrada durante o XVI Encontro do Grupo de Usuários de Python de Pernambuco, Recife - Pernambuco - 03/09/2011 por Marcel Pinheiro Caraciolo
Recommender Systems with Ruby (adding machine learning, statistics, etc)Marcel Caraciolo
This document discusses the use of Ruby for recommendation systems and related tasks like data analysis and visualization. It provides examples of how Ruby libraries and tools like Recommendable, NMatrix, BioRuby, and RubyDoop can be used for tasks like collaborative filtering, content-based recommendations, machine learning, scientific computing, and processing large datasets. The document also discusses some common challenges for recommendation systems and how different approaches like content-based and collaborative filtering attempt to address them.
King crab, also known as red king crab or giant Kamchatka crab, is native to seas in the northern Far East region. The document summarizes the nutritional value and chemical composition of king crab meat per 100 grams. It also provides details about a Ukrainian company that produces and exports canned king crab products from raw crab caught off the coast of Kamchatka and processed in facilities near Odessa, Ukraine. The production meets food safety standards and the company offers a variety of king crab products for sale.
Sistemas de Recomendação e Inteligência ColetivaMarcel Caraciolo
Marcel Pinheiro Caraciolo discute sistemas de recomendação, incluindo como eles funcionam para personalizar a experiência de compra do usuário, como filtragem colaborativa, de conteúdo e híbrida, e como as recomendações são avaliadas e apresentadas aos usuários.
1. Hermit crabs do not have an exoskeleton and instead live inside empty snail and whelk shells for protection and shelter.
2. They are commonly found in rocky shores where there is ample food and empty shells.
3. Hermit crabs have many predators including seagulls, pelicans, octopuses, other crabs, humans, and seals.
lobsters and crab fisheries in INDIA is a vast and enormous amount of catch and exports are being made.
this slide describes about the methods, distribution, annual landings and important species of lobster and crabs in India.
1. Mud crabs, commonly known as Scylla serrata and Scylla tranquebarica, are found along the coasts of India, particularly in Andhra Pradesh, Tamil Nadu, and Kerala. Mud crab farming is an important industry in several Southeast Asian countries.
2. The document provides details on mud crab habitat and feeding habits, reproduction, and different farming techniques such as pond culture, pen culture, and cage culture. It also discusses economic considerations of mud crab culture and fattening.
3. Mud crab farming can be a profitable activity, with net profits of over Rs. 1 lakh possible per crop using various culture methods over 4-7 months.
Brood stock management and larval rearing of mud crab scylla serrata-Gayatri ...Gayatri R. Kachh
This document provides information about the mud crab Scylla serrata, including its natural range, classification, life stages, and aquaculture practices. Key points include:
- S. serrata is an economically important crab species found in mangroves and estuaries in Africa, Australia, and Asia.
- Its life stages include juvenile, subadult, and adult crabs that inhabit different zones, as well as larvae and megalopae.
- Aquaculture of S. serrata involves maintaining broodstock for breeding and larval rearing, then culturing megalopae through to market size in ponds. Proper water quality, feeding, and health management are
This document summarizes the methods developed by SEAFDEC Aquaculture Department for large-scale production of juvenile mud crabs (Scylla spp.). The key steps include:
1) Breeding pond-grown female mud crabs and examining their ovaries for maturity. Mature females are held individually and spawn eggs attached to their pleopods.
2) Larval rearing of hatched zoea in concrete tanks with rotifers and Artemia as food at specific stocking densities and water conditions.
3) Nursery of megalopa in concrete tanks or net cages in brackishwater ponds, with natural pond food and feeding of minced fish/mussel
Mud crabs, also known as mangrove crabs, occur widely in estuaries and along tropical, subtropical and warm temperate coasts in the world. There are four species of mud crab (Family: Portunidae), Scylla serrata, S. tranquebarica, S. paramamosain and S. olivacea that are the focus of both commercial fisheries and aquaculture production throughout their distribution. They are among the most valuable crab species in the world, with the bulk of their commercial production sent live to market.
The document provides information about the mud crab (Scylla serrata), including its classification, distribution, ecology, and methods for farming and hatchery management. It notes that mud crabs are found naturally in the Indo-Pacific region and are an economically important species. Mud crab farming can be done using grow-out systems, where young crabs are raised for 5-6 months until market size, or fattening systems, where soft-shelled crabs are raised for a shorter period. Hatchery management involves broodstock selection and care, spawning and larval rearing techniques, and achieving survival rates as high as 18.1% compared to the world average of 3-3.5
The document discusses breeding and seed production techniques for various aquaculture species in Southeast Asia. It covers the life cycles, sexual maturity sizes, spawning seasons and methods, larval rearing protocols, and hatchery management practices for marine fish, tilapia, crustaceans, and abalone. Constraints to sustainable aquaculture development in the region include the availability of technology, seed supply, suitable feeds, disease management, and trained personnel.
Crabs are arthropods that have 10 legs, a hard shell, and walk sideways. There are almost 5,000 species of crabs, with 4,500 being true crabs and 500 being hermit crabs that steal shells from other animals for protection. The largest crab is the Japanese spider crab, with a leg span of up to 3.7 meters, while the largest land crab is the coconut crab with a leg span of up to 75 cm. Crabs are found on beaches around the world and come in various colors depending on the species.
Summary of the achievements of the four engineering teams that forms Core Development Group at Linaro during the one week long Linaro Connect USA 2014 (LCU14)
Advancing Reinaldo Gonsalves’ Model of Global Economic InsertionIan Walcott-Skinner
This paper is located in what is referred to as policy critique within the theoretical framework of International Political Economy (IPE) which, by origin, seeks to problematize issues of policy. In 1994, celebrated Brazilian economist, Reinaldo Gonsalves produced an important thesis and model on how to measure a country’s global insertion. At that time, Gonsalves could not have foreseen the influence of the Internet on global trade or on domestic trade policies. As such, the issue of global digital connectivity now presents itself as another pillar to measure global insertion. By examining regional Caribbean policy in this regard, this is an opportunity to advance Gonsalves’ model stimulate further on the opportunities associated with global digital connectivity.
The document discusses the Generic Connection Framework (GCF) in Java ME, which defines interfaces for network connections on CLDC platforms. GCF includes interfaces for stream-based, datagram-based, and content-based connections. Example implementations are provided for HTTP, TCP sockets, UDP datagrams, serial communications, and more.
Marcel Caraciolo is a scientist and CTO who has worked with Python for 7 years. He is interested in machine learning, mobile education, and data. He is the current president of the Python Brazil Association. Caraciolo has created several scientific Python packages and taught Python online. He is now working on applying Python to bioinformatics and clinical sequencing through tools like biopandas.
A talk to introduce Singularity Registry HPC, which allows you to install Singularity, Podman, or Docker containers (and others) as modules on an HPC system (e.g., LMOD or environment modules). Presented 2021.
Talk given at first OmniSci user conference where I discuss cooperating with open-source communities to ensure you get useful answers quickly from your data. I get a chance to introduce OpenTeams in this talk as well and discuss how it can help companies cooperate with communities.
King crab, also known as red king crab or giant Kamchatka crab, is native to seas in the northern Far East region. The document summarizes the nutritional value and chemical composition of king crab meat per 100 grams. It also provides details about a Ukrainian company that produces and exports canned king crab products from raw crab caught off the coast of Kamchatka and processed in facilities near Odessa, Ukraine. The production meets food safety standards and the company offers a variety of king crab products for sale.
Sistemas de Recomendação e Inteligência ColetivaMarcel Caraciolo
Marcel Pinheiro Caraciolo discute sistemas de recomendação, incluindo como eles funcionam para personalizar a experiência de compra do usuário, como filtragem colaborativa, de conteúdo e híbrida, e como as recomendações são avaliadas e apresentadas aos usuários.
1. Hermit crabs do not have an exoskeleton and instead live inside empty snail and whelk shells for protection and shelter.
2. They are commonly found in rocky shores where there is ample food and empty shells.
3. Hermit crabs have many predators including seagulls, pelicans, octopuses, other crabs, humans, and seals.
lobsters and crab fisheries in INDIA is a vast and enormous amount of catch and exports are being made.
this slide describes about the methods, distribution, annual landings and important species of lobster and crabs in India.
1. Mud crabs, commonly known as Scylla serrata and Scylla tranquebarica, are found along the coasts of India, particularly in Andhra Pradesh, Tamil Nadu, and Kerala. Mud crab farming is an important industry in several Southeast Asian countries.
2. The document provides details on mud crab habitat and feeding habits, reproduction, and different farming techniques such as pond culture, pen culture, and cage culture. It also discusses economic considerations of mud crab culture and fattening.
3. Mud crab farming can be a profitable activity, with net profits of over Rs. 1 lakh possible per crop using various culture methods over 4-7 months.
Brood stock management and larval rearing of mud crab scylla serrata-Gayatri ...Gayatri R. Kachh
This document provides information about the mud crab Scylla serrata, including its natural range, classification, life stages, and aquaculture practices. Key points include:
- S. serrata is an economically important crab species found in mangroves and estuaries in Africa, Australia, and Asia.
- Its life stages include juvenile, subadult, and adult crabs that inhabit different zones, as well as larvae and megalopae.
- Aquaculture of S. serrata involves maintaining broodstock for breeding and larval rearing, then culturing megalopae through to market size in ponds. Proper water quality, feeding, and health management are
This document summarizes the methods developed by SEAFDEC Aquaculture Department for large-scale production of juvenile mud crabs (Scylla spp.). The key steps include:
1) Breeding pond-grown female mud crabs and examining their ovaries for maturity. Mature females are held individually and spawn eggs attached to their pleopods.
2) Larval rearing of hatched zoea in concrete tanks with rotifers and Artemia as food at specific stocking densities and water conditions.
3) Nursery of megalopa in concrete tanks or net cages in brackishwater ponds, with natural pond food and feeding of minced fish/mussel
Mud crabs, also known as mangrove crabs, occur widely in estuaries and along tropical, subtropical and warm temperate coasts in the world. There are four species of mud crab (Family: Portunidae), Scylla serrata, S. tranquebarica, S. paramamosain and S. olivacea that are the focus of both commercial fisheries and aquaculture production throughout their distribution. They are among the most valuable crab species in the world, with the bulk of their commercial production sent live to market.
The document provides information about the mud crab (Scylla serrata), including its classification, distribution, ecology, and methods for farming and hatchery management. It notes that mud crabs are found naturally in the Indo-Pacific region and are an economically important species. Mud crab farming can be done using grow-out systems, where young crabs are raised for 5-6 months until market size, or fattening systems, where soft-shelled crabs are raised for a shorter period. Hatchery management involves broodstock selection and care, spawning and larval rearing techniques, and achieving survival rates as high as 18.1% compared to the world average of 3-3.5
The document discusses breeding and seed production techniques for various aquaculture species in Southeast Asia. It covers the life cycles, sexual maturity sizes, spawning seasons and methods, larval rearing protocols, and hatchery management practices for marine fish, tilapia, crustaceans, and abalone. Constraints to sustainable aquaculture development in the region include the availability of technology, seed supply, suitable feeds, disease management, and trained personnel.
Crabs are arthropods that have 10 legs, a hard shell, and walk sideways. There are almost 5,000 species of crabs, with 4,500 being true crabs and 500 being hermit crabs that steal shells from other animals for protection. The largest crab is the Japanese spider crab, with a leg span of up to 3.7 meters, while the largest land crab is the coconut crab with a leg span of up to 75 cm. Crabs are found on beaches around the world and come in various colors depending on the species.
Summary of the achievements of the four engineering teams that forms Core Development Group at Linaro during the one week long Linaro Connect USA 2014 (LCU14)
Advancing Reinaldo Gonsalves’ Model of Global Economic InsertionIan Walcott-Skinner
This paper is located in what is referred to as policy critique within the theoretical framework of International Political Economy (IPE) which, by origin, seeks to problematize issues of policy. In 1994, celebrated Brazilian economist, Reinaldo Gonsalves produced an important thesis and model on how to measure a country’s global insertion. At that time, Gonsalves could not have foreseen the influence of the Internet on global trade or on domestic trade policies. As such, the issue of global digital connectivity now presents itself as another pillar to measure global insertion. By examining regional Caribbean policy in this regard, this is an opportunity to advance Gonsalves’ model stimulate further on the opportunities associated with global digital connectivity.
The document discusses the Generic Connection Framework (GCF) in Java ME, which defines interfaces for network connections on CLDC platforms. GCF includes interfaces for stream-based, datagram-based, and content-based connections. Example implementations are provided for HTTP, TCP sockets, UDP datagrams, serial communications, and more.
Marcel Caraciolo is a scientist and CTO who has worked with Python for 7 years. He is interested in machine learning, mobile education, and data. He is the current president of the Python Brazil Association. Caraciolo has created several scientific Python packages and taught Python online. He is now working on applying Python to bioinformatics and clinical sequencing through tools like biopandas.
A talk to introduce Singularity Registry HPC, which allows you to install Singularity, Podman, or Docker containers (and others) as modules on an HPC system (e.g., LMOD or environment modules). Presented 2021.
Talk given at first OmniSci user conference where I discuss cooperating with open-source communities to ensure you get useful answers quickly from your data. I get a chance to introduce OpenTeams in this talk as well and discuss how it can help companies cooperate with communities.
Nicholas Schiller presented on using APIs to customize library services. He demonstrated how to build a web application using the WorldCat Search API that automatically adds Boolean search terms to a user's query and formats the results. The application was built with PHP for server-side scripting, HTML5 for interface design, and jQuery Mobile to optimize for different devices. The presentation provided examples of APIs, guidelines for API projects, and resources for further learning about APIs and programming.
This document summarizes an introduction to data analysis in Python using Wakari. It discusses why Python is a good language for data analysis, highlighting key Python packages like NumPy, Pandas, Matplotlib and IPython. It also introduces Wakari, a browser-based Python environment for collaborative data analysis and reproducible research. Wakari allows sharing of code, notebooks and data through a web link. The document recommends several talks at the PyData conference on efficient computing, machine learning and interactive plotting.
GitOps Core Concepts & Ways of Structuring Your ReposWeaveworks
Watch this talk on YouTube here: https://youtu.be/vLNZA_2Na_s
Whether you’re new to GitOps or a seasoned pro, this talk is for you! We'll start with the basics of how/where to get started, and then dive into one of the most asked GitOps questions: how to structure your repository!
During this talk, Scott & Pinky will review the Core Concepts of Flux including Git Sources, Reconciliation, Helm Releases, Kustomization, and Bootstrapping, to get you ramped up with how to think with a GitOps mindset! Then they’ll dive into and discuss considerations for and demo ways of structuring your repositories: monorepo, repo per environment, repo per team, or repo per app.
Resources:
- Flux on GitHub: https://github.com/fluxcd/flux2
- Flux docs: https://fluxcd.io/docs
- Core Concepts: https://fluxcd.io/docs/concepts/
- Sources: https://fluxcd.io/docs/components/source/
- Helm Releases: https://fluxcd.io/docs/guides/helmreleases/
- Kustomization: https://fluxcd.io/docs/components/kustomize/
Bootstrap: https://fluxcd.io/docs/installation/#bootstrap
- Ways of Structuring Your Repos: https://fluxcd.io/docs/guides/repository-structure/
Speaker Bios:
Priyanka “Pinky” Ravi is a Developer Experience Engineer at Weaveworks. She has worked on a multitude of topics including front end development, UI automation for testing and API development. Previously she was a software developer at a large insurance company where she was on the delivery engineering team working on GitOps enablement. She was instrumental in the multi-tenancy migration to utilize Flux for an internal Kubernetes offering. Outside of work, Priyanka enjoys hanging out with her husband and two rescue dogs as well as traveling around the globe.
Scott is a Brooklyn based interdisciplinary artist and Developer Advocate at Weaveworks. He co-founded the Basekamp art and research group in 1998 and the massively collaborative Plausible Artworlds international network. In technology he enjoys helping develop open source software that anyone can use, most recently projects in the cloud native landscape including co-maintaining Helm and Flux. In daily decisions, large or small, he tries to help make the world a better place for everyone.
A Year of Pyxley: My First Open Source AdventureNick Kridler
A quick introduction of Pyxley and the lessons learned over the last year of maintaining the package. Pyxley is a set of wrappers and helpers in Python that streamline the development of React.js based web applications driven by a Flask backend.
Slides de mi Conferencia: We Are Digital Puppets Actualizada (Inglés) que dicté en San Francisco CA. Hablo sobre el Tracking y el profiling de personas.
Deck used for my talk during PyDataNYC in which I described how we improved thumbnail cropping in our news app, Kamelio. We used Deep Learning object detection to identify the interesting regions of the image which was subsequently fed into image cropping logic.
This summary provides an overview of the key topics and speakers at the QCon Beijing conference on April 23-25. Some of the topics included Agile methodologies, Twitter architecture, JavaScript expert Douglas Crockford, Python web development, and more. Speakers would discuss Agile practices in China, how Twitter scales its infrastructure, Crockford's views on JavaScript and HTML5, Python frameworks like Flask and web.py, and techniques like test-driven development in Python. The conference aimed to cover a wide range of current technologies and approaches in software development.
This document discusses the importance of developers to higher education. It summarizes several projects funded by JISC that aim to support and connect developers, including Common Repositories Interfaces Group (CRIG), Wisdom of CRIG (WOCRIG), Developer Community Supporting Innovation (DevCSI), and Developer Days (dev8D). It argues that connecting developers leads to rapid innovation, knowledge transfer, and representation of developers' needs. Challenges include sustainability, perceptions of developers' value, and ensuring diversity among participants.
The quality of the python ecosystem - and how we can protect it!Bruno Rocha
The document discusses the quality of the Python ecosystem. It notes that while Python has many attractive qualities, the ecosystem faces challenges around library safety, documentation, and long-term maintainability. However, the community is the main driver of quality control. Improving tools for testing, verifying, and documenting libraries could help address issues. The responsibility of ecosystem quality ultimately lies with individual contributors collaborating openly through projects like PyPA to help evolve Python and PyPI.
Building web applications?
Thinking about auto-updater?
Need to document your releases?
Then look at this presentation.
You'll likely discover another point of view on these questions.
Let's say you're a data scientist, and you've been asked to build infrastructure. Here I've distilled some best practices as an introduction for people who are new to DevOps.
This document provides a summary of Jake VanderPlas' book "A Whirlwind Tour of Python". It introduces Python as a teaching and scripting language embraced by programmers, engineers, researchers, and data scientists. The book aims to provide a brief but comprehensive tour of the Python language for readers familiar with other languages, rather than starting from the basics. It covers Python's syntax, built-in types and data structures, functions, control flow, and other aspects to provide a foundation for exploring Python's data science ecosystem.
Collaborations in the Extreme: The rise of open code development in the scie...Kelle Cruz
Video: https://www.simonsfoundation.org/event/collaborations-in-the-extreme-the-rise-of-open-code-development-in-the-scientific-community/
The internet is changing the scientific landscape by fostering international, interdisciplinary and collaborative software development. More than ever before, software is a crucial component of any scientific result. The ability to easily share code is reshaping expectations about reproducibility -- a fundamental tenet of the scientific process. In this lecture, Kelle Cruz will briefly provide the backstory of how these shifts have come about, describe some of the most impactful open source projects, and discuss efforts currently underway aimed at ensuring these community-led projects are sustainable and receive support.
Data Science With Python | Python For Data Science | Python Data Science Cour...Simplilearn
This Data Science with Python presentation will help you understand what is Data Science, basics of Python for data analysis, why learn Python, how to install Python, Python libraries for data analysis, exploratory analysis using Pandas, introduction to series and dataframe, loan prediction problem, data wrangling using Pandas, building a predictive model using Scikit-Learn and implementing logistic regression model using Python. The aim of this video is to provide a comprehensive knowledge to beginners who are new to Python for data analysis. This video provides a comprehensive overview of basic concepts that you need to learn to use Python for data analysis. Now, let us understand how Python is used in Data Science for data analysis.
This Data Science with Python presentation will cover the following topics:
1. What is Data Science?
2. Basics of Python for data analysis
- Why learn Python?
- How to install Python?
3. Python libraries for data analysis
4. Exploratory analysis using Pandas
- Introduction to series and dataframe
- Loan prediction problem
5. Data wrangling using Pandas
6. Building a predictive model using Scikit-learn
- Logistic regression
This Data Science with Python course will establish your mastery of data science and analytics techniques using Python. With this Python for Data Science Course, you'll learn the essential concepts of Python programming and become an expert in data analytics, machine learning, data visualization, web scraping and natural language processing. Python is a required skill for many data science positions, so jumpstart your career with this interactive, hands-on course.
Why learn Data Science?
Data Scientists are being deployed in all kinds of industries, creating a huge demand for skilled professionals. Data scientist is the pinnacle rank in an analytics organization. Glassdoor has ranked data scientist first in the 25 Best Jobs for 2016, and good data scientists are scarce and in great demand. As a data you will be required to understand the business problem, design the analysis, collect and format the required data, apply algorithms or techniques using the correct tools, and finally make recommendations backed by data.
You can gain in-depth knowledge of Data Science by taking our Data Science with python certification training course. With Simplilearn Data Science certification training course, you will prepare for a career as a Data Scientist as you master all the concepts and techniques.
Learn more at: https://www.simplilearn.com
Six Principles of Software Design to Empower ScientistsDavid De Roure
Keynote talk for Workshop on Managing for Usability:
Challenges and Opportunities for E-Science Project Management, 10-11 April 2008,
OeRC, University of Oxford, UK
The 21st century; oh, what a time to be alive! With the world at your fingertips, it is easier than ever to dream big. But the question is- where to begin? With a wide range of programming languages to choose from to begin with, this article isn’t a gimmick for Python. Through this piece of writing, we hope to open you up to the realities of the world of Python. We will let you know the reasons why should I learn Python programming, what are the benefits of learning Python, what can I do with Python programming language and how can I start a career in Python Programming.
Este documento descreve como analisar seu próprio genoma usando tecnologias como Python. Apresenta os conceitos de sequenciamento de DNA, mapeamento, chamada de variantes e interpretação. Explica o fluxo de trabalho de um pipeline simples para analisar variantes em um genoma e fornece recursos para aprender mais sobre bioinformática.
Joblib: Lightweight pipelining for parallel jobs (v2)Marcel Caraciolo
This document discusses parallel computing in Python using joblib. It begins with an overview of different parallelization options in Python like threading and multiprocessing. It then discusses how joblib provides an easy way to parallelize Python code using multiprocessing without needing to explicitly manage processes. The document provides examples of using joblib to parallelize tasks like applying a function to a list of inputs and shows how it helps speed up computation by utilizing multiple CPU cores. It also discusses some considerations like interrupting jobs and memory usage when using joblib for parallelization.
Construindo softwares de bioinformática para análises clínicas : Desafios e...Marcel Caraciolo
O documento discute os desafios e oportunidades na construção de softwares de bioinformática para análises clínicas. Apresenta o laboratório Genomika, especializado em testes genéticos, e como a fusão de biologia molecular e tecnologia da informação é essencial para analisar grandes volumes de dados genéticos. Também destaca a importância da bioinformática para minerar bancos de dados na busca de mutações e como os sistemas de saúde podem ser aprimorados com novas tecnologias.
Como Python ajudou a automatizar o nosso laboratório v.2Marcel Caraciolo
O documento descreve como a linguagem de programação Python pode ser usada para automatizar tarefas em laboratórios de análise clínica, incluindo a análise de variantes genéticas, gestão de processos laboratoriais e infraestrutura de servidores. O autor também fornece recursos para quem deseja aprender bioinformática e trabalhar com análises genômicas usando Python.
Como Python pode ajudar na automação do seu laboratórioMarcel Caraciolo
O documento descreve como a linguagem Python pode ajudar a automatizar os processos de um laboratório de análises clínicas, incluindo a gestão de armazenamento e análise de grandes volumes de dados genômicos, o desenvolvimento de sistemas de gerenciamento laboratorial e notificações, e a infraestrutura de servidores e backups.
Este documento apresenta um tutorial sobre como hackear a web com Python 3 ministrado por Marcel Caraciolo. O tutorial introduz Python 3 e mostra como interagir com plataformas como Facebook, Reddit, MongoDB, Foursquare, Twitter e dados abertos usando a linguagem. O documento fornece links e códigos para que os participantes possam experimentar coletar e analisar dados dessas plataformas.
O documento discute vários modelos de negócios relacionados a software open-source, incluindo suporte e treinamento, consultoria em softwares open-source, Software as a Service, e venda de pacotes de software proprietário baseado em código-fonte open-source. O documento também fornece conselhos sobre como iniciar e manter projetos de código aberto.
Python Workshop on-line at Mutirao Python on-line via pycursos.com
https://www.youtube.com/watch?feature=player_embedded&v=DFh6l-h6-gw
Benchy, python framework for performance benchmarking of Python ScriptsMarcel Caraciolo
Benchy is a lightweight Python framework for performing benchmarks on code. It allows generating performance and memory usage graphs to compare different code implementations. Benchmarks can be written as objects and executed via a BenchmarkRunner to obtain results. Results are stored in a SQLite database and full reports can be generated in reStructuredText format. The framework aims to provide an easy way to integrate benchmarks into the development workflow.
O documento apresenta Python e 10 motivos para conhecer a linguagem, incluindo que é fácil de aprender, multi-paradigma, e usada por empresas como Google, Dropbox e Mozilla. Também discute como Python é expressiva e integra-se com outras linguagens como C/C++, .NET e MATLAB. Redes de apoio à comunidade Python no Brasil também são apresentadas.
GeoMapper, Python Script for Visualizing Data on Social Networks with Geo-loc...Marcel Caraciolo
This document describes a tool called GeoLocation Friends Visualizer that plots social network location data on a map. It was created by Marcel Caraciolo, a Python developer from Recife, Brazil who has been working with Python for 6 years. The tool and its source code are available on GitHub at a provided link.
Benchy: Lightweight framework for Performance Benchmarks Marcel Caraciolo
Benchy: Lightweight framework for Performance Benchmarks on Python Scripts.
Presented at XXVI Pernambuco Python User Group Meeting at Recife, Pernambuco, Brazil on 06.04.2013
Este documento apresenta Python como uma linguagem de programação interpretada, fácil de aprender e altamente produtiva que suporta paradigmas orientados a objetos, funcional e procedural. Apresenta exemplos básicos de código Python e discute como Python é usado por muitas grandes empresas, é de código aberto e possui uma comunidade ativa de desenvolvedores.
Construindo Soluções Científicas com Big Data & MapReduceMarcel Caraciolo
Este documento resume as principais informações sobre o uso de MapReduce e Big Data. Em três frases:
MapReduce é uma abordagem para processamento distribuído de grandes conjuntos de dados através de funções map e reduce. MrJob permite rodar trabalhos MapReduce em Python no Amazon EMR ou Hadoop de forma fácil. Exemplos mostram como usar MapReduce para recomendação de amigos em larga escala.
Como Python está mudando a forma de aprendizagem à distância no BrasilMarcel Caraciolo
1) Marcel Caraciolo é um cientista chefe e professor que usa Python para promover a educação à distância no Brasil.
2) Ele co-fundou o PyCursos.com, que oferece cursos on-line gratuitos de Python que atraíram centenas de alunos em várias cidades.
3) Dados mostram que abordagens interativas como exercícios on-line durante os vídeos melhoram o engajamento e desempenho dos alunos.
Novas Tendências para a Educação a Distância: Como reinventar a educação ?Marcel Caraciolo
Apresentação realizada durante a Conferência Talk a Bit em Junho/2012 e realizada durante o PET 2012 por Marcel Caraciolo.
Universidade Federal de Pernambuco, 2012
O documento descreve como construir um webcrawler para rastrear encomendas usando Python e expressões regulares. Primeiro, ele explica como fazer o download da página HTML com os dados de rastreamento e extrair o conteúdo. Depois, ele discute como analisar o HTML baixado usando expressões regulares para obter as informações de status e localização da encomenda.
O documento descreve como manipular arquivos ZIP em Python usando o módulo zipfile. É possível criar, ler, extrair arquivos e obter metadados de arquivos ZIP como nome, tamanho e data de modificação.
PyFoursquare is a Python wrapper for the Foursquare API that allows developers to easily access Foursquare data from their Python applications. It currently supports searching for places and retrieving place details, tips, and user information. The wrapper follows a similar architecture to Tweepy, representing each Foursquare entity as a model. Developers can authenticate their app, make API requests, and access results as model objects. The project is open source and the author welcomes contributions to support additional Foursquare entities and features.
Sistemas de Recomendação: Como funciona e Onde Se aplica?Marcel Caraciolo
This document discusses mobile recommender systems. It describes how recommender systems on mobile devices face challenges due to limitations of mobile contexts, such as location and processing capabilities. It presents the workflow and architecture of a mobile restaurant recommendation and navigation system. The system collects and analyzes location-based user data on the server side and provides personalized recommendations to users on their mobile clients. It discusses using context such as location, tags, and implicit feedback for recommendations on mobile.
Quantum Computing Quick Research Guide by Arthur MorganArthur Morgan
This is a Quick Research Guide (QRG).
QRGs include the following:
- A brief, high-level overview of the QRG topic.
- A milestone timeline for the QRG topic.
- Links to various free online resource materials to provide a deeper dive into the QRG topic.
- Conclusion and a recommendation for at least two books available in the SJPL system on the QRG topic.
QRGs planned for the series:
- Artificial Intelligence QRG
- Quantum Computing QRG
- Big Data Analytics QRG
- Spacecraft Guidance, Navigation & Control QRG (coming 2026)
- UK Home Computing & The Birth of ARM QRG (coming 2027)
Any questions or comments?
- Please contact Arthur Morgan at [email protected].
100% human made.
Generative Artificial Intelligence (GenAI) in BusinessDr. Tathagat Varma
My talk for the Indian School of Business (ISB) Emerging Leaders Program Cohort 9. In this talk, I discussed key issues around adoption of GenAI in business - benefits, opportunities and limitations. I also discussed how my research on Theory of Cognitive Chasms helps address some of these issues
Unlocking the Power of IVR: A Comprehensive Guidevikasascentbpo
Streamline customer service and reduce costs with an IVR solution. Learn how interactive voice response systems automate call handling, improve efficiency, and enhance customer experience.
TrsLabs - Fintech Product & Business ConsultingTrs Labs
Hybrid Growth Mandate Model with TrsLabs
Strategic Investments, Inorganic Growth, Business Model Pivoting are critical activities that business don't do/change everyday. In cases like this, it may benefit your business to choose a temporary external consultant.
An unbiased plan driven by clearcut deliverables, market dynamics and without the influence of your internal office equations empower business leaders to make right choices.
Getting things done within a budget within a timeframe is key to Growing Business - No matter whether you are a start-up or a big company
Talk to us & Unlock the competitive advantage
Social Media App Development Company-EmizenTechSteve Jonas
EmizenTech is a trusted Social Media App Development Company with 11+ years of experience in building engaging and feature-rich social platforms. Our team of skilled developers delivers custom social media apps tailored to your business goals and user expectations. We integrate real-time chat, video sharing, content feeds, notifications, and robust security features to ensure seamless user experiences. Whether you're creating a new platform or enhancing an existing one, we offer scalable solutions that support high performance and future growth. EmizenTech empowers businesses to connect users globally, boost engagement, and stay competitive in the digital social landscape.
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxshyamraj55
We’re bringing the TDX energy to our community with 2 power-packed sessions:
🛠️ Workshop: MuleSoft for Agentforce
Explore the new version of our hands-on workshop featuring the latest Topic Center and API Catalog updates.
📄 Talk: Power Up Document Processing
Dive into smart automation with MuleSoft IDP, NLP, and Einstein AI for intelligent document workflows.
Vaibhav Gupta BAML: AI work flows without Hallucinationsjohn409870
Shipping Agents
Vaibhav Gupta
Cofounder @ Boundary
in/vaigup
boundaryml/baml
Imagine if every API call you made
failed only 5% of the time
boundaryml/baml
Imagine if every LLM call you made
failed only 5% of the time
boundaryml/baml
Imagine if every LLM call you made
failed only 5% of the time
boundaryml/baml
Fault tolerant systems are hard
but now everything must be
fault tolerant
boundaryml/baml
We need to change how we
think about these systems
Aaron Villalpando
Cofounder @ Boundary
Boundary
Combinator
boundaryml/baml
We used to write websites like this:
boundaryml/baml
But now we do this:
boundaryml/baml
Problems web dev had:
boundaryml/baml
Problems web dev had:
Strings. Strings everywhere.
boundaryml/baml
Problems web dev had:
Strings. Strings everywhere.
State management was impossible.
boundaryml/baml
Problems web dev had:
Strings. Strings everywhere.
State management was impossible.
Dynamic components? forget about it.
boundaryml/baml
Problems web dev had:
Strings. Strings everywhere.
State management was impossible.
Dynamic components? forget about it.
Reuse components? Good luck.
boundaryml/baml
Problems web dev had:
Strings. Strings everywhere.
State management was impossible.
Dynamic components? forget about it.
Reuse components? Good luck.
Iteration loops took minutes.
boundaryml/baml
Problems web dev had:
Strings. Strings everywhere.
State management was impossible.
Dynamic components? forget about it.
Reuse components? Good luck.
Iteration loops took minutes.
Low engineering rigor
boundaryml/baml
React added engineering rigor
boundaryml/baml
The syntax we use changes how we
think about problems
boundaryml/baml
We used to write agents like this:
boundaryml/baml
Problems agents have:
boundaryml/baml
Problems agents have:
Strings. Strings everywhere.
Context management is impossible.
Changing one thing breaks another.
New models come out all the time.
Iteration loops take minutes.
boundaryml/baml
Problems agents have:
Strings. Strings everywhere.
Context management is impossible.
Changing one thing breaks another.
New models come out all the time.
Iteration loops take minutes.
Low engineering rigor
boundaryml/baml
Agents need
the expressiveness of English,
but the structure of code
F*** You, Show Me The Prompt.
boundaryml/baml
<show don’t tell>
Less prompting +
More engineering
=
Reliability +
Maintainability
BAML
Sam
Greg Antonio
Chris
turned down
openai to join
ex-founder, one
of the earliest
BAML users
MIT PhD
20+ years in
compilers
made his own
database, 400k+
youtube views
Vaibhav Gupta
in/vaigup
[email protected]
boundaryml/baml
Thank you!
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...Alan Dix
Talk at the final event of Data Fusion Dynamics: A Collaborative UK-Saudi Initiative in Cybersecurity and Artificial Intelligence funded by the British Council UK-Saudi Challenge Fund 2024, Cardiff Metropolitan University, 29th April 2025
https://alandix.com/academic/talks/CMet2025-AI-Changes-Everything/
Is AI just another technology, or does it fundamentally change the way we live and think?
Every technology has a direct impact with micro-ethical consequences, some good, some bad. However more profound are the ways in which some technologies reshape the very fabric of society with macro-ethical impacts. The invention of the stirrup revolutionised mounted combat, but as a side effect gave rise to the feudal system, which still shapes politics today. The internal combustion engine offers personal freedom and creates pollution, but has also transformed the nature of urban planning and international trade. When we look at AI the micro-ethical issues, such as bias, are most obvious, but the macro-ethical challenges may be greater.
At a micro-ethical level AI has the potential to deepen social, ethnic and gender bias, issues I have warned about since the early 1990s! It is also being used increasingly on the battlefield. However, it also offers amazing opportunities in health and educations, as the recent Nobel prizes for the developers of AlphaFold illustrate. More radically, the need to encode ethics acts as a mirror to surface essential ethical problems and conflicts.
At the macro-ethical level, by the early 2000s digital technology had already begun to undermine sovereignty (e.g. gambling), market economics (through network effects and emergent monopolies), and the very meaning of money. Modern AI is the child of big data, big computation and ultimately big business, intensifying the inherent tendency of digital technology to concentrate power. AI is already unravelling the fundamentals of the social, political and economic world around us, but this is a world that needs radical reimagining to overcome the global environmental and human challenges that confront us. Our challenge is whether to let the threads fall as they may, or to use them to weave a better future.
IT help desk outsourcing Services can assist with that by offering availability for customers and address their IT issue promptly without breaking the bank.
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025BookNet Canada
Book industry standards are evolving rapidly. In the first part of this session, we’ll share an overview of key developments from 2024 and the early months of 2025. Then, BookNet’s resident standards expert, Tom Richardson, and CEO, Lauren Stewart, have a forward-looking conversation about what’s next.
Link to recording, transcript, and accompanying resource: https://bnctechforum.ca/sessions/standardsgoals-for-2025-standards-certification-roundup/
Presented by BookNet Canada on May 6, 2025 with support from the Department of Canadian Heritage.
Technology Trends in 2025: AI and Big Data AnalyticsInData Labs
At InData Labs, we have been keeping an ear to the ground, looking out for AI-enabled digital transformation trends coming our way in 2025. Our report will provide a look into the technology landscape of the future, including:
-Artificial Intelligence Market Overview
-Strategies for AI Adoption in 2025
-Anticipated drivers of AI adoption and transformative technologies
-Benefits of AI and Big data for your business
-Tips on how to prepare your business for innovation
-AI and data privacy: Strategies for securing data privacy in AI models, etc.
Download your free copy nowand implement the key findings to improve your business.
This is the keynote of the Into the Box conference, highlighting the release of the BoxLang JVM language, its key enhancements, and its vision for the future.
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxAnoop Ashok
In today's fast-paced retail environment, efficiency is key. Every minute counts, and every penny matters. One tool that can significantly boost your store's efficiency is a well-executed planogram. These visual merchandising blueprints not only enhance store layouts but also save time and money in the process.
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxAnoop Ashok
Introduction to Crab - Python Framework for Building Recommender Systems
1. Crab
A Python Framework for Building
Recommendation Engines
Scipy 2011, Austin TX
Marcel Caraciolo Ricardo Caspirro Bruno Melo
@marcelcaraciolo @ricardocaspirro @brunomelo
1
2. What is Crab ?
A python framework for building recommendation engines
A Scikit module for collaborative, content and hybrid filtering
Mahout Alternative for Python Developers :D
Open-Source under the BSD license
https://github.com/muricoca/crab
2
3. When started ?
It began one year ago
Community-driven, 4 members
Since April,2011 the open-source labs Muriçoca incorporated it
Since April,2011 rewritting it as Scikit
https://github.com/muricoca/
3
4. Knowing Scikits
Scikits are Scipy Toolkits - independent and projects hosted
under a common namespace.
Scikits Image
Scikits MlabWrap
Scikits AudioLab
Scikit Learn
....
http://scikits.appspot.com/scikits
4
5. Knowing Scikits
Scikit-Learn
Machine Learning Algorithms + scientific Python packages
(Numpy, Scipy and Matplotlib)
http://scikit-learn.sourceforge.net/
Our goal: Incorporate the Crab as Scikit and incorporate
some parts of them at Scikit-learn
5
7. Why Recommendations
* +,&-.$/).#&0#/"1.#$%234(".# ?
$/)#5(&6 7&.2.#"$4,#)$8
We are overloaded
* 93((3&/.#&0#:&'3".;#5&&<.#
$/)#:-.34#2%$4<.#&/(3/"
Thousands of news articles and blog posts each day
* =/#>$/&3;#?#@A#+B#4,$//"(.;#
2,&-.$/).#&0#7%&6%$:.#
Millions of movies, books and music tracks online
"$4,#)$8
Several Places, Offers and Events
* =/#C"1#D&%<;#."'"%$(#
Even Friends sometimes we are overloaded !
2,&-.$/).#&0#$)#:"..$6".#
."/2#2&#-.#7"%#)$8
7
8. Why Recommendations ?
We really need and consume only a few of them!
“A lot of times, people don’t know what
they want until you show it to them.”
Steve Jobs
“We are leaving the Information age, and
entering into the Recommendation age.”
Chris Anderson, from book Long Tail
8
9. Why Recommendations ?
Can Google help ?
Yes, but only when we really know what we are looking for
But, what’s does it mean by “interesting” ?
Can Facebook help ?
Yes, I tend to find my friends’ stuffs interesting
What if i had only few friends and what they like do not always
attract me ?
Can experts help ?
Yes, but it won’t scale well.
But it is what they like, not me! Exactly same advice!
9
10. Why Recommendations ?
Recommendation Systems
Systems designed to recommend to me something I may like
10
12. The current Crab
Collaborative Filtering algorithms
User-Based, Item-Based and Slope One
Evaluation of the Recommender Algorithms
Precision, Recall, F1-Score, RMSE
Precision-Recall Charts
12
15. The current Crab
Using REST APIs to deploy the recommender
django-piston, django-rest, django-tastypie
15
16. Crab is already in production
Brazilian Social Network called Atepassar.com
Educational network with more than 60.000 students and 3000 video-classes
Running on Python
+ Numpy + Scipy and
Django
Backend for Recommendations
MongoDB - mongoengine
Daily Recommendations
with Explanations
16
17. Evaluating your recommender
Crab implements the most used recommender metrics.
Precision, Recall, F1-Score, RMSE
Using matplotlib
for a plotter utility
Implement new metrics
Simulations support maybe (??)
17
19. Distributing the recommendation computations
Use Hadoop and Map-Reduce intensively
Investigating the Yelp mrjob framework https://github.com/pfig/mrjob
Develop the Netflix and novel standard-of-the-art used
Matrix Factorization, Singular Value Decomposition (SVD), Boltzman machines
The most commonly used is Slope One technique.
19
20. Why migrate ?
Old Crab running only using Pure Python
Recommendations demand heavy maths calculations and lots of processing
Compatible with Numpy and Scipy libraries
High Standard and popular scientific libraries optimized for scientific calculations in Python
Scikits projects are amazing!
Active Communities, Scientific Conferences and updated projects (e.g. scikit-learn)
Turn the Crab framework visible for the community
Join the scientific researchers and machine learning developers around the Globe coding with
Python to help us in this project
Be Fast and Furious
20
22. How are we working ?
Sprints, Online Discussions and Issues
https://github.com/muricoca/crab/wiki/UpcomingEvents
22
23. Future Releases
Planned Release 0.1
Collaborative Filtering Algorithms working, sample datasets to load and test
Planned Release 0.11
Evaluation of Recommendation Algorithms and Database Models support
Planned Release 0.12
Recommendation as Services with REST APIs
....
23
24. Join us!
1. Read our Wiki Page
https://github.com/muricoca/crab/wiki/Developer-Resources
2. Check out our current sprints and open issues
https://github.com/muricoca/crab/issues
3. Forks, Pull Requests mandatory
4. Join us at irc.freenode.net #muricoca or at our
discussion list
http://groups.google.com/group/scikit-crab
24
25. Crab
A Python Framework for Building
Recommendation Engines
https://github.com/muricoca/crab
Marcel Caraciolo Ricardo Caspirro Bruno Melo
@marcelcaraciolo @ricardocaspirro @brunomelo
{marcel, ricardo,bruno}@muricoca.com
25