Posts with the tag: Data Collection

Etherium Token Recommender

Read in 1 minute ·

Etherium Token Recommender When using Ethereum, users hold a unique wallet, within each wallet the user holds tokens that seem attractive to the user. Project Goal Generate top 5 relevant tokens based on collaborative filtering from the Ethereum Blockchain Genesis. Tools PySpark DataBricks Delta Lake S3 MLFlow Github Project Link

Posts

Read in 0 minute ·

Graduation Rate Predictor

Read in 1 minute ·

Graduation Rate Predictor The aim of this project is to model the way government expenditures and labor appropriation impacts secondary education graduation rates in New York State Public Schools. Our experiments show that diminishing returns are not present in funding, rather the educational staff’s quality affects graduation rates. Our highest performing model predicted graduation rate SVR with a median squared error of 3.863. Github Project Link

Amazon Recommendation Classification

Read in 1 minute ·

Amazon Recommendation Classification Amazon curates the buying experience for each user utilizing advanced algorithms and frequent item-set techniques to drive revenue. In addition to recommendation algorithms, pessimistic or interested buyers will consult the reviews posted below a product to gauge whether the product is a “smart” purchase. Our goal is to accurately classify the review score as function of review summary and text. We utilized NLP techniques, such as Non-Negative Matrix Factorization (NMF), Latent Dirichlet Allocation (LDA), and Term Frequency–Inverse Document Frequency (TF-IDF) to classify the Amazon reviews.

Congressional Tweet Classification

Read in 1 minute ·

Congressional Tweet Classification We extract, transform, and analyze over 857,000 records to classify a tweet’s owner as a Democrat or Republican. We utilized the Logistic regression technique that exhibited 88.884 percent accuracy. We conclude that a tweet’s content can reveal the owner as Democrat or Republican. Github Project Link

Ishida Checkweigher

Read in 1 minute ·

Ishida Checkweigher DACS-G Ishida Checkweigher can send data through Modbus and RS-232 connections. The script is intended to be utilized on a Raspberry Pi, which can communicate with RS-232. Capital Requirements: Raspberry Pi 2 or greater generation Male USB to Male RS-232 port Female to Female RS-232 gender converter Female/Male Null modem adapter Scheduling Scrapping Script Cron is a tool for configuring scheduled tasks on Unix systems. Find cron documentation for implementation.

DS Smith Scrapping

Read in 1 minute ·

A dieline is a graphic design as a placeholder for assisting in the proper layout of a document that will be diecut as part of the finishing process. A packaging engineer asked if a script can be written to load the dieline files, view the files, and archive pertinent information. The answer is YES! The script locates the dielines, copies the files to a local directory for analysis, and uses index analysis to document necessary information.

Alpha Live Collection

Read in 1 minute ·

Alpha Live Data Capital Requirements OPC Server Utilized RsLinx OPC Server PC and DDE clients are supported for any number of devices. It also supports applications developed for the RSLinx Classic C API. But note that this is limited to 32bit client only. Additional information is found within Rockwell's documentation. Other OPC servers may be utilized, but have not been tested. Matrikon OPC Cyberlogic Graybox Computer System configure with Python 3 or later Database SQL Express Oracle Microsoft Access Implementation Identify the OPC server’s IP address, all OPC tags should be hosted on the OPC server.

Email Data Export

Read in 1 minute ·

A simplistic script that exports queries an SQL database, defines the data in a pandas DataFrame, exports the DataFrame to an Excel document, and attaches the Excel export to a gmail. The consultant was unable to retrieve the data from remote locations; therefore, the recurring email was advantageous. Github Project Link

Apartments.com Scrapping

Read in 1 minute ·

Shopping for an Apartment? This python script will retrieve the searches from aparments.com & return a data frame. Call the function with a single URL or a list of URLs, happy shopping! Required Modules Scraping Functions BeautifulSoup, pandas, & requests. Personal Execution Example Seaborn, apartment_scrap, matplotlib, & matplotlib.pyplot. Github Project Link